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M any theories have been advanced over the years to explain 
why people behave as they do. Until recently the most common 
view, popularized by various personality doctrines, depicted be¬ 
havior as impelled by inner forces in the form of needs, drives, and 
impulses, often operating below the level of consciousness. Since the 
principal causes of behavior resided in forces within the individual, 
that is where one looked for explanations of man’s actions. Although 
this view enjoyed widespread professional and popular acceptance, 
it did not go unchallenged. 

Theories of this sort were criticized on both conceptual and em¬ 
pirical grounds. The inner determinants were typically inferred from 
the behavior they supposedly caused, resulting in pseudo explana¬ 
tions. Thus, for example, a hostile impulse was deduced from a 
persons irascible behavior, which was then attributed to the action 
of the underlying impulse. Different personality theories proposed 
diverse lists of motivators, some containing a few all-purpose drives, 
others embracing a vaned assortment of specific drives. 

The conceptual structure of psychodynamic theories was further 
criticized for disregarding the tremendous complexity of human 
responsiveness. An internal motivator cannot possibly account for 
the marked variation in the incidence and strength of a given 
behavior in different situations, toward different persons, at differ¬ 
ent times, and in different social roles. When diverse social influ¬ 
ences produce correspondingly diverse behaviors, the inner cause 
implicated in the relationship cannot be less complex than its effects. 

While the conceptual adequacy of psychodynamic formulations 
was debatable, their empirical limitations could not be ignored 
indefinitely. They provided intriguing interpretations of events that 
had already happened, but they lacked power to predict how people 
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would behave in given situations [Mische! 1968], 
i oreover, it was difficult to demonstrate that persons 
who had undergone psychodynamically oriented 
treatment benefited more than nontreated cases 
[Bandura 1969a, Bergin 1966], Acquiring insight into 
the underlying impulses through which behavioral 
changes were supposedly achieved turned out to 
represent more of a social conversion than a self- 
chacovery process. As Manner [1962], among others, 
pointed out each psychodynamic approach had its 
own favored set of inner causes and its own preferred 
brand of insight. The presence of these determinants 
could be easily confirmed through suggestive prob- 
311(1 selective reinforcement of clients’ verbal 
reports in self-validating interviews. For these rea- 
sons, advocates of differing theoretical orientations 
repeatedly discovered their favorite psychodynamic 
agents but rarely found evidence for the hypothe¬ 
sized causes emphasized by proponents of competing 
views. The content of a particular client’s insights 
and emergent unconscious" could therefore be bet¬ 
ter predicted from knowledge of the therapist’s belief 
than from the client ' s actual social learning 

It evenriiaHy became apparent that if progress in 
understanding human behavior was to be accele¬ 
rated, more stnngent requirements would have to be 

sTsmrns tHe adeiJUaC >' ° f ^Placatory 

systems. Theories must demonstrate predictive 

p°wer, and they must accurately identify causal fac¬ 
ers, as shown by the fact that varying the postulated 
determinants produces related changes in behavior. 

TTie attnbution of behavior to inner forces can 

othtT u T d f ° 6ar 'y cx P lanat °ry schemes in 
other branches of science. At one time diverse chcm 

ica reactions were supposedly caused by movements 

of a material substance called phlogiston physical 

stncf Pr ° pe ' led ** es- 

ces and physiological functioning was ascribed to 

he action of humors. Developments in learning 

ffieoiy shifted the focus of causal analysis from 

ypothesized inner determinants to detailed exami- 

H U °“ ? , eXtema ' uifluences on responsiveness. 

SThe sh r i0r WaS eXtensiveI y an aly*ed in terms 
of the stimulus events that evoke it and the reinforc¬ 
es consequences that alter it. Researchers repeatedly 
demonstrated that response patterns, generally at- 

elinlffiald 0 U !5 derlymg f ° rCeS ' could be induced, 
eliminated, and reinstated simply by varying exter- 

’IZ lZ PS - V '*“ lt f sU ' especially p,opo„e„, s ol 
cal forms of behaviorism, to the view that the 
^ses of behavior are found not in the organism 
but in environmental forces. 6 

The idea that man’s actions are under external 


control, though amply documented, was not enthu¬ 
siastically received for a variety of reasons. To most 
people it unfortunately implied a one-way influence 
process that reduced man to a helpless reactor to the 
vagaries of external rewards and punishments The 
view that behavior is environmentally determined 
also appeared to contradict firm, but ill-founded 
beliefs that people possess .generalized personality’ 
traits leading them to behave in a consistent man 
ner, however variable the social influences. 

A more valid criticism of the extreme behavioristic 
position is that, in a vigorous effort to eschew spu- 
nous inner causes, it neglected determinants of man’s 
behavior arising from his cognitive functioning. Man 
is a thinking organism possessing capabilities that 
provide him with some power of self-direction To 
the extent that traditional behavioral theories could 
be faulted, it was for providing an incomplete rather 
than an inaccurate account of human behavior. 

In the social learning view-, man is neither driven 
by inner forces nor buffeted helplessly by environ¬ 
mental influences. Rather, psychological functioning 
■s best understood in terms of a continuous recipro 
cal interaction between behavior and its controlling 
conditions. The social learning theory outlined in 
this paper places special emphasis on the important 
roles played by vicarious, symbolic, and self-regula¬ 
tory processes, which receive relatively little atten- 
hon even in most contemporary theories of learning. 

hese differences in governing processes carry cer¬ 
tain implications for the way one views the causes 
of human behavior. 

Traditional theories of learning generally depict 
behavror as the product of directly experienced 
response consequences. In actuality, virtually all 
learning phenomena resulting from direct experi¬ 
ences can occur on a vicarious basis through obser¬ 
vation of other people’s behavior and its consequen- 
ces for them. Man’s capacity to learn by observation 
enables him to acquire large, integrated units of 
behavior by example without having to build up 
the patterns gradually by tedious trial and error. 
Similarly, emotional responses can be developed 
observationally by witnessing the affective reactions 
ot others undergoing painful or pleasurable experi¬ 
ences. ]• earful and defensive behavior can be 
extinguished vicariously by observing others engage 
in the feared activities without any adverse conse¬ 
quences. And behavioral inhibitions can be induced 
by seeing others punished for their actions. 

Man’s superior cognitive capacity is another factor 
that determines, not only how he will be affected by 
his experiences, but the future direction his actions 
may take People can represent external influences 
symbolically and later use such representations to 
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guide their actions; they can solve problems symbol¬ 
ically without having to enact the various alterna¬ 
tives; and they can foresee the probable consequences 
of different actions and alter their behavior accord¬ 
ingly. These higher mental processes permit both 
insightful and foresightful behavior. 

A third distinguishing feature of man is that he is 
capable of creating self-regulative influences. By 
managing the stimulus determinants of given activi¬ 
ties and producing consequences for their own 
actions, people are able to control their own behavior 
to some degree. As illustrated later, cognitive and 
self-regulative influences often serve important func¬ 
tions in causal sequences. The remainder of this 
paper is devoted to a detailed social learning analysis 
of how patterns of behavior are acquired and how 
their expression is continuously regulated by the 
interplay of self-generated and other sources of 
influences. 

Learning by Direct Experience 

In the social learning system, new patterns of be¬ 
havior can be acquired through direct experience or 
by observing the behavior of others. The more rudi¬ 
mentary form of learning, rooted in direct experi¬ 
ence, is largely governed by the rewarding and 
punishing consequences that follow any given action. 
People are repeatedy confronted with situations with 
which they must deal in one way or another. Some 
of the responses that they tiy prove unsuccessful, 
while others produce more favorable effects. Through 
this process of differential reinforcement successful 
modes of behavior are eventually selected from 
exploratory activities, while ineffectual ones are 
discarded. 

It is commonly believed that responses are auto¬ 
matically and unconsciously strengthened by their 
immediate consequences. Simple performances can 
be altered to some degree through reinforcement 
without awareness of the relationship between one’s 
actions and their outcomes. However, man s cogni¬ 
tive skills enable him to profit more extensively from 
experience than if he were an unthinking organism. 
Within the framework of social learning theory, 
reinforcement primarily serves informative and in¬ 
centive functions, although it also has response- 
strengthening capabilities. 

Informative Function of Reinforcement 

During the course of learning, people not only 
perform responses, but they also observe the differ¬ 
ential consequences accompanying their various 
actions. On the basis of this informative feedback, 


they develop thoughts or hypotheses about the 
types of behavior most likely to succeed. These 
hypotheses then serve as guides for future actions 
[Dulany & O'Connell 1963]. Accurate hypotheses 
give rise to successful performances, whereas erro¬ 
neous ones lead to ineffective courses of action. The 
cognitive events are thus selectively strengthened or 
disconfirmed by the differential consequences ac¬ 
companying the more distally occurring overt be¬ 
havior. In this analysis of learning by experience, 
reinforcing consequences partly serve as an unarticu- 
lated way of informing performers what they must 
do in order to gain beneficial outcomes or to avoid 
punishing ones, 

Motivational Function of Reinforcement 

Because of man’s anticipatory capacity, conditions 
of reinforcement also have strong incentive-motiva¬ 
tional effects. Most human behavior is not controlled 
by immediate external reinforcement. As a result of 
prior experiences, people come to expect that certain 
actions will gain them outcomes they value, others 
will have no appreciable effects, and still others will 
produce undesired results. Actions are therefore 
regulated to a large extent by anticipated consequen¬ 
ces. Homeowners, for instance, do not wait until they 
experience the misery of a burning house to buy fire 
insurance; people who venture outdoors do not 
ordinarily wait until discomforted by a torrential 
rain or a biting snowstorm to decide what to wear; 
nor do motorists usually wait until inconvenienced 
by a stalled automobile to replenish gasoline. 

Through the capacity to represent actual outcomes 
symbolically, future consequences can be converted 
into current motivators that influence behavior in 
much the same way as actual consequences. Man’s 
cognitive skills thus provide him with the capabiity 
for both insightful and foresightful behavior. 

Cognitive Mediation of Reinforcement Effects 

A great deal of research has been conducted on 
whether behavior is learned through the automatic 
action of consequences or whether the effects of 
reinforcement are cognitively mediated. Most of 
these studies have employed verbal conditioning 
situations in which subjects converse freely or con¬ 
struct sentences and the experimenter rewards cer¬ 
tain classes of words but ignores all others. Changes 
in the incidence of reinforced verbalizations are then 
examined as a function of whether the participants 
were aware that their verbal utterances were selec¬ 
tively reinforced and whether they recognized the 
types of words that produced reinforcement. 
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Spielberger and De Nike [1966], proponents of an 
exclusively cognitive view of learning, measured 
subjects awareness at periodic intervals throughout 
the verbal conditioning session. They found that 
subjects displayed no rise in the number of rein¬ 
forced responses as long as they remained unaware 
of the reinforcement contingency, but they suddenly 
increased their output of appropriate responses upon 
discovering what types of words were rewarded. 
Dulany [1962, 1968] similarly found that reinforcing 
consequences were ineffective in modifying subjects’ 
behavior as long as they were unaware of what they 
had to do to produce rewarding outcomes. These 
authors concluded that learning cannot take place 
without awareness of what is being reinforced. 
Dulany, who outlined this position in a formal way, 
contended that the acquired response information 
gives rise to intentions or self-instructions to pro¬ 
duce the required behaviors, the strength of the 
tendency depending on the incentive value of the 
consequences. Neither these findings nor the general¬ 
izations went un contested. 

Earlier studies by Postman and Sassenrath [1961], 
who also examined the temporal relation between 
emergence of awareness and changes in responsive¬ 
ness, found that reinforcement produced small but 
significant improvements in performance prior to 
awareness, but subjects sharply increased appropri¬ 
ate responses after they hit upon the correct solution. 
Learning, they concluded, can occur without aware¬ 
ness, albeit slowly and quite inefficiently. The re¬ 
sultant increase in correct responses makes it easier 
to discern what is wanted; once the discovery has 
been made, the appropriate behavior is readily per¬ 
formed, given adequate incentives. A sudden rise in 
correct responding with discovery of the reinforce¬ 
ment contingencies is generally regarded as insight¬ 
ful behavior. 

The discrepant findings concerning the relation¬ 
ship between awareness and behavior change largely 
reflected how adequately awareness was measured. 
Spielberger and De Nike [1966] assessed at infre¬ 
quent intervals during the learning task (after 
twenty-five trials) whether subjects figured out the 
correct responses. Subjects may, therefore, have dis¬ 
cerned the reinforcement contingency late in the 
insight block of trials after a noticeable increase in 
performance was produced by reinforcement in the 
absence of awareness. By measuring awareness at 
shorter intervals, Kennedy [1970, 1971] demonstrated 
that this is precisely what happens. When data are 
plotted in twenty-five-trial blocks, awareness seems 
to precede behavior change, but when these same 
data are plotted in five- or ten-trial blocks, perform¬ 


ance gains precede awareness for subjects who later 
recognize the correct types of responses. 

Whether emergence of awareness is accompanied 
by increases, decreases, or no change in performance 
largely depends on the reward value of the response 
consequences. People who are aware of appropriate 
responses in a given situation and who value the 
outcomes they produce change their behavior in the 
reinforced direction. On the other hand, those who 
are equally aware of the reinforcement contingencies 
but who devalue either the required behavior or the 
reinforcers not only remain uninfluenced but may 
even respond in an oppositional manner. 

Reinforcing Effects of Response Consequences 

Verbal conditioning procedures are adequate for 
demonstrating that awareness can facilitate changes 
in behavior, but they are ill-suited for resolving the 
basic question of whether awareness is a prerequisite 
for learning or performance change. The limitations 
are twofold: first, the verbalizations that are modified 
already exist in the subjects’ repertoires-sp that the 
task does not involve response learning. Second, since 
both the responses and their outcomes are observ¬ 
able, experimenters have to rely on subjects' verbal 
reports of unknown reliability to figure out who is 
aware of what. 

The question of whether learning must be con¬ 
sciously mediated is answered most decisively by 
studies in which either the reinforced responses or 
their effects are not attributable to awareness be¬ 
cause the action-outcome relationship cannot be 
observed. Hefferline and his associates [Hefferline 
& Keenan 1963, Hefferline, Keenan, & Harford 1959] 
and Sasmor [1966] successfully modified the be¬ 
havior of adults through reinforcement even though 
the latter were unable to observe their rewarded 
responses. In these studies the occurrence of visibly 
imperceptible muscular responses, detected by the 
experimenter through electronic amplification, is 
reinforced either by monetary reward or by termina¬ 
tion of unpleasant stimt/lation. Unseen responses in¬ 
crease substantially during reinforcement and de¬ 
cline abruptly after reinforcement is withdrawn. As 
might be expected, none of the people could identify 
the response that produced the reinforcing conse¬ 
quences. 

In the preceding investigation, the appropriate 
responses were unobservable but the reinforcers were. 
Experiments have been conducted with animals in 
which the correct responses are noticeable hut their 
reinforcing consequences are not [Chambers 1956, 
Coppock & Chambers 1954, Coppock, Headlee, & 
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Hood 1953]. Results of such studies likewise show 
that responses can be automatically strengthened 
through selective reinforcement operating below the 
level of awareness. 

Evidence that elementary performances can be 
increased through reinforcement without die me¬ 
diating effects of awareness does not mean that peo¬ 
ple can learn to respond in accordance with rela¬ 
tively complicated principles in this manner as well, 
for an illustration of how rule-governed behavior is 
under cognitive control, visualize a task in which 
subjects are presented with words varying in length, 
to which they must provide a correct number. Let 
us select an arbitrary rule wherein the correct num¬ 
ber is obtained by subtracting the number of letters 
in a given word from 97, dividing the remainder by 
3 ; a* 1d then multiplying the result by 5. In this situa¬ 
tion, appropriate responses are derived from a high- 
order rule, 

R = 97 - N X 5, 

3 

requiring a three-step transformation of the external 
stimulus. To achieve errorless performances, one 
must perform accurately several mental operations in 
a particular sequence. An unthinking organism is 
unlikely to show any performance gains, however 
long its responses may be reinforced. 

The overall evidence reveals that response con¬ 
sequences can be informative, motivating, and rein¬ 
forcing. Therefore, in any given instance, contingent 
reinforcement may produce changes in behavior 
through any one or more of the three processes. Peo¬ 
ple can learn some patterns of behavior by experi¬ 
encing rewarding and punishing consequences, but 
if they know what they are supposed to do to secure 
desired outcomes, they profit much more from such 
experiences. As shown later, while reinforcement is 
a powerful method for regulating behaviors that 
have already been learned, it is a relatively inefficient 
way of creating them. 


Learning through Modeling 

Although behavior can be shaped into new pat¬ 
terns to some extent by rewarding and punishing 
consequences, learning would be exceedingly labori¬ 
ous and hazardous if it proceeded solely on this 
basis. Environments are loaded with potentially 
lethal consequences that befall those who are un¬ 
fortunate enough to perform dangerous errors. For 
this reason it would be ill-advised to rely on differen¬ 


tial reinforcement of trial-and-error performances i 
teaching children to swim, adolescents to drive autc 
mobiles, and adults to develop complex occupations 
and social competencies. Apart from questions c 
survival, it is difficult to imagine a socialization prc 
cess in which the language, mores, vocational activi 
ties, familial customs, and the educational, religious 
and political practices of a culture are taught to eac) 
new member by selective reinforcement of fortui 
tous behaviors, without benefit of models who exem 
plify the cultural patterns in their own behavior. 

Most of the behaviors that people display ar- 
learned, either deliberately or inadvertently, througl 
the influence of example. There are several reason 
why modeling influences figure prominently in hu 
man learning in everyday life. When mistakes ar< 
costly or dangerous, new modes of response can b< 
developed without needless errors by providing com 
petent models who demonstrate how the requirec 
activities should be performed. Some complex be 
haviors, of course, can be produced .only through th< 
influence of models. If children had no'opportunib 
to hear speech, for example, it would be virtual!) 
impossible to teach them the linguistic skills tha 
constitute a language. It is doubtful that one coulc 
ever shape intricate individual words, let alonf 
grammatical speech, by differential reinforcement oi 
random vocalizations. Where novel forms of behavioi 
can be conveyed only by social cues, modeling is an 
indispensable aspect of learning. Even in instances 
where it is possible to establish new response pat¬ 
terns through other means, the process of acquisition 
can be considerably shortened by providing appro¬ 
priate models. Under most circumstances, a good 
example is therefore a much better teacher than the 
consequences of unguided actions. 

A number of different theories have been pro¬ 
posed over the years to explain how people learn by 
observing the behavior of others. Major tenets of 
these alternative views, and their empirical status, 
are discussed at length elsewhere [Bandura 1971b]. 
The social learning analysis of observational learn¬ 
ing differs from contemporary learning interpreta¬ 
tions principally in the locus of response integration, 
in the role played by cognitive functions, and in the 
manner in which reinforcement influences observa¬ 
tional learning. 

Nonmediated Stimulus-Response Theories 

There was no research to speak of on modeling 
processes until the classic publication Social Learn¬ 
ing and Imitation by Miller and Dollard [1941]. 
They advanced the view that in order for imitative 
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learning to occur observers must be motivated to act, 
they must be provided with an example of the de¬ 
sired behavior, they must perform responses that 
match the example, and their imitative behavior 
must be positively reinforced. In experiments con¬ 
ducted to test these assumptions, a model always 
chose one of two boxes that contained two rewards, 
and observers received one of the rewards whenever 
they went to the same box. Observers not only 
learned to follow the model, but they generalized 
eopying responses to new situations, to new models, 
and to different motivational states. 

These experiments were widely accepted as demon¬ 
strations of imitative learning, although they repre¬ 
sent only a special form of discrimination place- 
learning. The model’s actions simply informed chil¬ 
dren where to go but did not teach them any new 
behavior. Had a light or some other informative cue 
been used to indicate the correct box, the model's 
behavior would have been irrelevant and perhaps 
even a hindrance to efficient performance. Observa¬ 
tional response learning, on the other hand, is con¬ 
cerned with how people organize behavioral ele¬ 
ments to form new response patterns exemplified by 
others. Since the theory proposed by Miller and 
Dollard requires a person to display imitative 
responses before he can learn them, it accounts 
more adequately for the performance of previously 
established matching responses than for their acqui¬ 
sition. It is perhaps for this reason that publication 
of their research monographs, which contained many 
provocative ideas, aroused only a passing interest. 
Despite the prevalence and powerful influence of 
example in the development and regulation of hu¬ 
man behavior, traditional accounts of learning con¬ 
tain little or no mention of modeling processes. 

The operant conditioning analysis of modeling 
[Baer & Sherman 1964, Skinner 1953, Gewirtz & 

Stingle 1968] relies entirely upon the standard three- 
component paradigm S d — R— -S', where S*> 
denotes the modeled stimulus, R represents an overt 
matching response, and S r designates the reinforcing 
stimulus. Except for deletion of the motivational re¬ 
quirement, Skinner’s explanation posits the same 
necessary conditions (that is, cue, response, rein¬ 
forcement) for imitation originally proposed by 
Miller and Dollard. Observational learning, accord¬ 
ing to the operant view, is presumably achieved 
through differential reinforcement. When responses 
corresponding to the model’s actions are positively 
reinforced and divergent responses either nonre- 
warded or punished, the behavior of others comes 
to function as cues for matching responses. 

The scheme above does not appear to be appli¬ 


cable to observational learning where an observer 
does not overtly perform the model's responses in 
the setting in which they are exhibited, reinforce¬ 
ments are not administered either to the model or to 
the observer, and whatever responses have been 
thus acquired are not displayed for days, weeks, or 
even months. Under these conditions, which repre¬ 
sent one of the most prevalent forms of social learn¬ 
ing, two of the factors (R —»S r ) in the three-element 
paradigm are absent during acquisition, and the 
third factor (S d , or modeling stimulus) is typically 
missing from the situation when the observationally 
learned response is first performed. Like the Miller 
and Dollard theory, Skinner's analysis clarifies 'how 
similar behavior that a person has previously leauned 
can be prompted by the actions of others and the 
prospect of reward However, it does not explain 
how a new matching response is acquired obseiva- 
tionally in the first place. As shown later, such 
learning occurs through symbolic processes during 
exposure to the modeled activities before any re¬ 
sponses have been performed or reinforced. 

Social Learning Analysis of 
Observational Learning 

Social learning theory assumes that modeling in¬ 
fluences produce learning principally through their 
informative functions and that observers acquire 
mainly symbolic representations of modeled activities 
rather than specific stimulus-response associations 
[Bandura 1969a, 1971a]. In this formulation, model¬ 
ing phenomena are governed by four interrelated 
subprocesses. 

Attentional processes. A person cannot learn 
much by observation if he does not attend to, or 
recognize, the essential features of the model’s be¬ 
havior. One of the component functions in learning 
by example is therefore concerned with attentional 
processes. Simply exposing persons to models does 
not in itself ensure that they will attend closely to 
them, that they will necessarily select from the 
models numerous characteristics the most relevant 
ones, or that they will even perceive accurately the 
aspects they happen to notice. 

Among the numerous factors that determine ob¬ 
servational experiences, associational preferences are 
undoubtedly of major importance. The people with 
whom one regularly associates delimit the types of 
behavior that one will repeatedly observe and hence 
learn most thoroughly. Opportunities for learning 
aggressive behavior obviously differ markedly for 
members of delinquent gangs and of Quaker groups. 
Within any social group some members are likely 
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to command greater attention than others. The func¬ 
tional value of the behaviors displayed by different 
models is highly influential in deter minin g which 
models will be closely observed and which will be 
ignored. Attention to models is also channeled by 
their interpersonal attraction. Models who possess 
interesting and winsome qualities are sought out, 
whereas those who lack pleasing characteristics tend 
to be ignored or rejected, even though they may 
excel in other ways. 

Some forms of modeling are so intrinsically re¬ 
warding that they can hold the attention of people 
of all ages for extended periods. This is nowhere 
better illustrated than in televised modeling. Indeed, 
models presented in televised form are so effective 
in capturing attention that viewers leam the depicted 
behavior regardless of whether or not they are given 
extra incentives to do so [Bandura, Grusec, k Men- 
love 1966], 

Retention processes. A person cannot be much 
influenced by observation of a model's behavior if 
he has no memory of it. A second major function 
involved in observational learning concerns long¬ 
term retention of activities that have been modeled 
at one time or another. If one is to reproduce a 
model’s behavior when the latter is no longer present 
to serve as a guide, the response patterns must be 
represented in memory in symbolic form. By this 
means past influences can achieve some degree of 
permanence. 

Observational learning involves two representa¬ 
tional systems—an imaginal and a verbal one. During 
exposure, modeling stimuli produce, through a pro¬ 
cess of sensory conditioning, relatively enduring, 
retrievable images of modeled sequences of be¬ 
havior. Indeed, under conditions where stimulus 
events are highly correlated, as when a name is 
consistently associated with a given person, it is vir¬ 
tually impossible to hear the name without experi¬ 
encing imagery of the person's physical characteristics. 
Similarly, reference to activities (golfing, siding), 
places (San Francisco, Paris), and things (one's au¬ 
tomobile, Washington Monument) that one has pre¬ 
viously observed immediately elicits vivid imaginal 
representations of the absent physical stimuli. 

The second representational system, which prob¬ 
ably accounts for the notable speed of observational 
learning and long-term retention of modeled con¬ 
tents by humans, involves verbal coding of obseiwed 
events. Most of the cognitive processes that regulate 
behavior are primarily verbal rather than visual. The 
route traversed by a model can be acquired, retained, 
and later reproduced more accurately by verbal 
coding of the visual information into a sequence of 


right and left turns (for example, RLRRL) than b' 
reliance upon visual imagery of the itinerary. Ob 
servational learning and retention are facilitated b' 
such codes because they carry a great deal of in 
formation in an easily stored form. 

After modeled activities have been transformer 
into images and readily utilizable verbal symbols 
these memory codes serve as guides for subsequen 
reproduction of matching responses. That symbolic 
coding can enhance observational learning is showi 
by studies conducted both with children [Bandura 
Grusec, & Menlove 1966, Coates k Hartup 1969] anc 
with adults [Bandura & Jeffery 1971, Gen>t N 1971] 
Observers who code modeled activities into eithei 
words, concise labels, or vivid imagery leam anc 
retain the behavior better than those who simpl) 
observe or are mentally preoccupied with othei 
matters while watching the performance of others 

In addition to symbolic coding, rehearsal serves a< 
an important memory aid. People who mentally re¬ 
hearse or actually perform modeled patterns of be¬ 
havior are less likely to forget them than are those 
who neither think about nor practice what the) 
have seen. Some of the behaviors that are learned 
observationally cannot be easily strengthened by 
overt enactment either because they are socially pro¬ 
hibited or because the necessary appartus is lacking. 
It is therefore of considerable interest that mental 
rehearsal of modeled activities can increase their 
retention [Bandura k Jeffery 1971, Michael k Mac- 
coby 1961]. 

Some researchers [Gewirtz k Stingle 1968] have 
been especially concerned about the conditions that 
produce the first imitative responses on the assump¬ 
tion that they help explain observational learning at 
later stages of development. There Is some reason to 
question whether initial and later imitations have 
equivalent determinants. In early years imitative 
responses are evoked directly and immediately by 
models actions. In later periods, imitative responses 
are usually performed in the absence of the models 
long after their behavior has been observed. Imme¬ 
diate imitation does not require much in the way of 
cognitive functioning because the behavioral repro¬ 
duction is externally guided by the model's actions. 
By contrast, in delayed imitation, the absent modeled 
events must be internally represented, and factors 
such as symbolic transformation and cognitive organ¬ 
ization of modeling stimuli and covert rehearsal, 
which facilitate retention of acquired contents, serve 
as determinants of observational learning. The differ¬ 
ence between physically prompted and delayed imi¬ 
tation is analogous to drawing a picture of one's 
automobile when it is at hand, and from memory. 
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In the latter situation, the hand does not automati¬ 
cally sketch the car; rather, one must rely on memory 
guides, mainly in the form of mental images. 

Motoric reproduction processes. The third com¬ 
ponent of modeling is concerned with processes 
whereby symbolic representations guide overt ac¬ 
tions. To achieve behavioral reproduction, a learner 
must put together a given set of responses according 
to the modeled patterns. The amount of observa¬ 
tional learning that a person can exhibit behavioraUy 
depends on whether or not he has acquired the 
component skills. If he possesses the constituent 
elements, he can easily integrate them to produce 
new patterns of behavior, but if the response com¬ 
ponents are lacking, behavioral reproduction will be 
faulty. Given extensive deficits, the subskills required 
for complex performances must first be developed by 
modeling and practice. 

Even though symbolic representations of modeled 
activities are acquired and retained, and the sub¬ 
skills exist, an individual may be unable to coordi¬ 
nate various actions in the required pattern and se- 
quence because of physical limitations. A young 
child can leam observationally the behavior for 
drivmg an automobile and be adept at executing 
the component responses, but if he is too short to 
operate the controls he cannot maneuver the ve- 
hide successfully. 

There is a third impediment at the behavioral level 
to skillful reproduction of modeled activities that 
have been learned observationally. In most coordi¬ 
nated motor skills, such as golf and swimming, 
performers cannot see the responses that they are 
making; hence, they must rely on ill-defined pro¬ 
prioceptive cues or verbal reports of onlookers. It is 
exceedingly difficult to guide actions that are not 
easily observed or to identify the corrective adjust¬ 
ments needed to achieve a close match of symbolic 
model and overt performance. In most everyday 
learning, people usually achieve rough approxima- 
bons of new patterns of behavior by modeling and 
refine them through self-corrective adjustments on 
the basis of informative feedback from performance. 
nemforcement and motivational processes. A per- 

acqmre ' retaii1 ' and possess tlie capabilities for 
skillful execution of modeled behavior, but the learn¬ 
ing may rarely be activated into overt performance 
it is negatively sanctioned or otherwise unfavor¬ 
ably received. When positive incentives are provi¬ 
ded, observational learning, which previously re¬ 
mained unexpressed, is promptly translated into 
action [Bandura 1965], Reinforcement influences not 
only regulate the overt expression of matching be¬ 


havior, but they can affect the level of observational 
learning by controlling what people attend to and 
how actively they code and rehearse what they 
have seen. 7 

For reasons given above, the provision of models, 
even prominent ones, will not automatically create 
similar patterns of behavior in others. If one is in¬ 
terested merely in producing imitative behavior, 
some of the subprocesses included in the social 
learning analysis of modeling can be disregarded. A 
model who repeatedly demonstrates desired respon¬ 
ses, instructs others to reproduce them, physically 
prompts the behavior when it fails to occur, and then 
administers powerful rewards will eventually elicit 
matching responses in most people. It may require 
1, 10, or 100 demonstration trials, but if one persists, 
the desired behavior will eventually be evoked. If, on 
the other hand, one wishes to explain why modeling 
does or does not occur, a variety of determinants 
must be considered. In any given instance lack of 
matching behavior following exposure to modeling 
influences may result from either failure to observe 
the relevant activities, inadequate coding of modeled 
events for memory representation, retention decre¬ 
ments, motoric deficiencies, or inadequate conditions 
of reinforcement. 


Locus of Response Integration in Observational 

Learning 

New patterns of behavior are created by organiz¬ 
ing constituent responses into certain patterns and 
sequences. Theories of modeling differ on whether 
the response integration occurs mainly at central 
or at peripheral levels. Operant conditioning formu¬ 
lations [Baer & Sherman 1964, Gewirtz & Stingle 
1968] assume that response elements are selected 
out of overt performances by providing appropriate 
antecedent stimuli and by rewarding actions that 
resemble the modeled behavior and ignoring those 
that do not. The response components presumably 
thus extracted are sequentially chained by the influ¬ 
ence of reinforcement to form more complex units of 
behavior. Since, in this view, behavior is organized 
into new patterns in the course of performance 
learning requires overt responding and immediate 
reinforcement. 

According to social learning theory, behavior is 
learned, at least in rough form, before it is per¬ 
formed. By observing a model of the desired be¬ 
havior, an individual forms an idea of how response 
components must be combined and temporally se¬ 
quenced to produce new behavioral configurations. 
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The representation serves as a guide for behavioral 
reproduction. Observational learning without per¬ 
formance is abundantly documented in modeling 
studies using a nonresponse acquisition procedure 
[Bandura 1969, Flanders 1968]. After watching 
models perform novel modes of response, observers 
can later describe the entire pattern of behavior with 
considerable accuracy, and given the appropriate 
conditions, they often achieve errorless behavioral 
reproductions on the first test trial. 

It is commonly believed that controversies about 
the locus of learning cannot be satisfactorily resolved 
because learning must be inferred from performance. 
This may very well be the case in experimentation 
with animals. To determine whether a rat has mas¬ 
tered a maze one must run him through it. With 
humans, there exists a reasonably accurate index of 
learning that is independent of motor performance. 
To measure whether a human has learned a maze by 
observing the successful performances of a model, 
one need only ask him to describe the correct pat¬ 
tern of right and left turns. Such an experiment 
would undoubtedly reveal that people can learn 
through modeling before they perform. 

Role of Reinforcement in Observational Learning 

Another issue in contention concerns the role of 
reinforcement in observational learning. As pre¬ 
viously noted, reinforcement-oriented theories [Baer 
& Sherman 1964, Miller & Dollard 1941, Gewirtz & 
Stingle 1968] assume that imitative responses must 
be reinforced in order to be learned. Social learning 
theory, on the other hand, distinguishes between 
learning and performance of matching behavior. 
Observational learning, in this view, can occur 
through observation of modeled behavior and ac¬ 
companying cognitive activities without extrinsic 
reinforcement. This is not to say that mere exposure 
to modeled activities is, in itself, sufficient to produce 
observational learning. Not all stimulation that im- 
pinges on individuals is necessarily observed by 
them, and even if attended to, the influence of model- 

Reinforcement Theories 
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ing stimuli alone does not ensure that they will be 
retained for any length of time. 

Anticipation of reinforcement is one of several fac¬ 
tors that can influence what is observed and what 
goes unnoticed. Knowing that a given model's be¬ 
havior is effective in producing valued rewards or 
averting negative consequences can enhance observa¬ 
tional learning by increasing observers’ attentiveness 
to .the model’s actions. Moreover, anticipated rein¬ 
forcement can strengthen retention of what has been 
learned observationally by motivating people to code 
and to rehearse modeled responses that have high 
value. Theories of modeling primarily differ in the 
manner in which reinforcement influences observa¬ 
tional learning rather than in whether reinforcement 
may play a role in the acquisition process. As shown 
in the schematization below, the issue in dispute is 
whether reinforcement acts backward to strengthen 
preceding imitative responses and their association to 
stimuli or whether it facilitates learning through its 
effects on attentional, organizational, and rehearsal 
processes. It would follow from social learning theory 
that a higher level of observational learning would 
be achieved by informing observers in advance about 
the payoff value for adopting modeled patterns of 
behavior than by waiting until observers happen to 
imitate a model and then rewarding them for it. 

In social learning theory reinforcement is con¬ 
sidered a facilitate rather than a necessary condi¬ 
tion because there are factors other than response 
consequences that can influence what people will 
attend to. One does not have to be reinforced, for 
example, to hear compelling sounds ‘or to look at 
prominent visual displays. Hence, when people’s 
attention to modeled activities can be gained through 
physical means, the addition of positive incentives 
does not increase observational learning [Bandura, 
Grusec, & Menlove 1966]. Children who intently 
watched modeled actions on a television screen in a 
room darkened to eliminate distractions later dis¬ 
played the same amount of imitative learning re¬ 
gardless of whether they were informed in advance 
that correct imitations would be rewarded or given 
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no prior incentives to learn the modeled perform* 
ances. Anticipated reinforcement would be expected 
to exert greatest influence on observational learning 
under self-selection conditions where people can 
choose whom they will attend to and how intensively 
they observe their behavior. 

Both operant conditioning and social learning 
theories assume that whether or not people choose to 
perform what they have learned observationally is 
strongly influenced by the consequences of such 
actions. In social learning theory, however, behavior 
is regulated, not only by directly experienced con¬ 
sequences from external sources, but by vicarious 
reinforcement and self-reinforcement. 

The Modeling Process and Transmission of 
Response Information 

A major function of modeling stimuli is to transmit 
information to observers on how to organize com¬ 
ponent responses into new patterns of behavior. 
This response information can be conveyed through 
physical demonstrations, through pictorial represen¬ 
tation, or through verbal description. 

Much social learning occurs on the basis of casual 
or studied observation of exemplary models. As lin¬ 
guistic skills are developed, verbal modeling is 
gradually substituted for behavioral modeling as the 
preferred mode of response guidance. By performing 
sequences of actions as described in instructional 
manuals, people can learn how to assemble and 
operate complicated mechanical equipment, how to 
behave in a variety of unfamiliar social situations, 
and how to perform vocational and recreational tasks 
in a skillful manner. Verbal modeling is used exten¬ 
sively because one can convey through words an 
almost infinite variety of complex behaviors that 
would be exceedingly difficult and time consuming to 
portray behaviorally. 

Another influential source of social learning is the 
abundant and varied symbolic modeling provided in 
television, films, and other pictorial displays. There 
is a large body of research evidence showing that 
both children and adults can acquire attitudes, emo¬ 
tional responses, and new patterns of behavior as 
a result of observing filmed or televised models 
[Bandura 1969a, Flanders 1968, Lumsdaine 1961]. 
Considering the large amount of time that people 
spend watching televised models, mass media may 
play an influential role in shaping behavior and so¬ 
cial attitudes. With further developments in commu¬ 
nication technology, it will be possible to have almost 
any activity portrayed on request at any time on re¬ 
mote television consoles [Parker 1970]. As such forms 


of symbolic modeling are increasingly used, parents, 
teachers, and other traditional role models may as¬ 
sume a less prominent role in social learning. 

The basic modeling process is the same regardless 
of whether the desired behavior is conveyed through 
words, pictures, or live actions. Different forms of 
modeling, however, are not always equally effective. 
It is frequendy difficult to convey through words the 
same amount of information contained in pictorial or 
live demonstrations. Some forms of modeling may 
also be more powerful than others in commanding 
attention. Children, or adults for that matter, rarely 
have to be compelled to watch television, whereas 
verbal characterizations of the same activities would 
fail to hold their attention for long. One might also 
expect observers who lack conceptual skills to bene¬ 
fit less from verbal modeling than from behavioral 
demonstrations. 

Scope of Modeling Influences 

In many instances the behavior displayed by exem¬ 
plary models must be learned in essentially the same 
form. For example, driving automobiles or perform- 
ing surgical operations permit little, if any, depar¬ 
ture from established practices. In addition to trans¬ 
mitting fixed repertoires of behavior, modeling in¬ 
fluences can, contrary to common belief, create gen¬ 
erative and innovative behavior as well. 

In studying more complex forms of modeling, per¬ 
sons observe models respond to different stimuli in ac¬ 
cordance with a preselected rule or principle. Observ¬ 
ers are subsequently tested under conditions where 
they can behave in a way that is stylistically similar 
to the model's disposition, but they cannot mimic his 
specific responses. To take an example, a model 
constructs from a set of nouns sentences containing 
the passive voice. Children are later instructed to 
generate sentences from a different set of nouns with 
the model absent, and the incidence of passive con¬ 
structions is recorded. In this higher form of model¬ 
ing, observers must abstract common features exem¬ 
plified in diverse modeled responses and formulate 
a rule for generating similar patterns of behavior. 
Responses performed by subjects that embody the 
observationally derived rule are likely to resemble 
the behavior that the model would be inclined to 
exhibit under similar circumstances, even though 
observers had never witnessed the model's behavior 
in these.new situations. 

A number of studies have been conducted demon¬ 
strating how response-generative rules can be trans¬ 
mitted through modeling. Young children who had 
no formal grammatical knowledge altered their 
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syntactic style in accord with the rules guiding the 
modeled verbal constructions [Bandura & Harris 
1966, Liebert, Odom, Hill, & Huff 1969, Rosenthal 
& Whitebook 1970]. In addition, modeling influ¬ 
ences have been successful in modifying moral judg¬ 
mental orientations [Bandura & McDonald 1963, 
Cowan, Langer, Heavenrich, & Nathanson 1969], 
delay of gratification patterns [Bandura & Mischel 
1965, Stumphauzer 1969], and styles of information 
seeking [Rosenthal, Zimmerman, & Duming 1970]. 
Researchers have also begun to study how modeling 
influences alter cognitive functioning of the type 
described by Piaget and his followers [Rosenthal & 
Zimmerman 1970, Sullivan 1967]. 

The broader effects of modeling are further re¬ 
vealed in studies employing several models who 
display different patterns of behavior. Observers may 
select one or more of the models as the primary 
source of behavior, but they rarely restrict their 
imitation to a single source, nor do they adopt all of 
the characteristics of the preferred model. Rather, 
observers generally exhibit relatively novel responses 
representing amalgams of elements from different 
models [Bandura, Ross, & Ross 1963a]. Paradoxical 
as it may seem, innovative patterns can emerge solely 
through modeling. Thus, within a given family same- 
sex siblings may develop distinct personality charac¬ 
teristics as a result of adopting different combinations 
of parental and sibling attributes. A succession of 
modeling influences, in which observers later be¬ 
come sources of behavior for new members, would 
most likely produce a gradual imitative evolution of 
novel patterns that might bear little resemblance to 
those exhibited by the original models. In homoge¬ 
neous cultures, where all models display similar 
modes of response, imitative behavior may undergo 
little or no change across successive models. It is 
diversity in modeling that fosters behavioral inno¬ 
vation. 

The discussion thus far has been concerned solely 
with the process of learning through modeling. A 
second major function of modeling influences is to 
strengthen or to weaken inhibitions of responses that 
observers have previously learned [Bandura 1971b]. 
The effects that models have on behavioral restraints 
are largely determined by observation of rewarding 
and punishing consequences accompanying models' 
responses. As a result of seeing a model’s actions 
punished, observers tend to inhibit behaving in a 
similar way. Conversely, observing models engage 
in threatening or prohibited activities without ex¬ 
periencing any adverse consequences can reduce 
inhibitions in observers. Such disinhibitory effects 
are most strikingly revealed in recent therapeutic 


applications of modeling principles [Bandura 1971a]. 
In these studies, people who dread and avoid certain 
activities are able to perform them in varying degrees 
after observing others perform the feared behaviors 
repeatedly without any harmful effects. 

The actions of others can also serve as social cues 
that influence how others will behave at any given 
time. Response facilitation by modeling can be dis¬ 
tinguished from observational learning and disinhibi- 
tion by the fact that the models actions neither teach 
new behaviors nor reduce inhibitions, because the 
behavior in question, which already exists, is socially 
sanctioned and therefore is unencumbered by re¬ 
straints. Inhibitory and disinhibitory effects of model¬ 
ing are examined later in the context of vicarious 
reinforcement, while social facilitation effects are 
given detailed consideration in the discussion of 
stimulus control of behavior. 

Modeling influences can have additional effects, 
though these may be of lesser importance. The be¬ 
havior of models directs observers’ attention to the 
particular objects used by the performer. As a re¬ 
sult, observers may subsequently use the same 
objects to a greater extent, though not necessarily 
in an imitative way. In one experiment, for example, 
children who had observed a model pummel a large 
doll with a mallet not only imitated this specific 
aggressive action but spent more time pounding 
other things with a mallet than did those who did 
not see a person handle this particular instrument 
[Bandura 1962]. Research findings, considered to¬ 
gether, disclose that modeling influences can serve 
as teachers, as inhibitors, as disirihibitors, as response 
elicitors, as stimulus enhancers, and as emotion 
arousers. 


Regulatory Processes 

A comprehensive theory of behavior must explain 
not only how response patterns are required but how 
their expression is regulated and maintained. In so¬ 
cial learning theory, human functioning relies on 
three regulatory processes. They include stimulus, 
cognitive, and reinforcement control. For explanatory 
purposes, these control functions are discussed sepa¬ 
rately, although in reality they are closely inter¬ 
related. 

Stimulus Control 

To function effectively a person must be able to 
anticipate the probable consequences of different 
events and courses of action and regulate his be¬ 
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havior accordingly. Without a capacity for anticipa¬ 
tory or foresightful behavior, man would be forced 
to act blindly in ways that might eventually prove 
to be highly unproductive, if not perilous. Informa¬ 
tion about probable consequences is conveyed by 
environmental stimuli, such as traffic signals, verbal 
communications, pictorial messages, distinctive 
places, persons, or things, or the actions of others. 

In the earliest years of development, environ¬ 
mental stimuli, except those that are inherently 
painful, exert little or no influence on infants and 
young children. As a result of paired experiences, 
either direct or vicarious, formerly neutral stimuli 
begin to acquire motivating and response-directive 
properties. Stimulus control over emotions and ac¬ 
tions is established in both instances through asso¬ 
ciation, but they differ in what gets associated. 

Stimulus Control of Physiological and 

Emotional Responsiveness 

Physiological responses can be most readily brought 
under the control of environmental stimuli by having 
two stimulus events occur contiguously. If a formerly 
neutral stimulus is closely associated with one that 
is capable of eliciting a given physiological response, 
the former stimulus alone gradually acquires the 
power to evoke the physiological response or a frac¬ 
tional component of it. Although some types of auto¬ 
matic responses are more difficult to condition than 
others, almost every form of somatic reaction that 
an organism is capable of making, including changes 
in respiration, heart rate, muscular tension, gastro¬ 
intestinal secretions, vasomotor reactions, and other 
indices of emotional responsiveness, has been classic¬ 
ally conditioned to innocuous stimuli [Kimble 1961]. 
Environmental events can likewise acquire the 
capacity to control cortical activity through associa¬ 
tion with either external evocative stimuli or direct 
brain stimulation [John 1967], Conditioning is by 
no means confined to events in the external envi¬ 
ronment. In recent years researchers [Razran 1961, 
Slucki, Adam, & Porter 1965] have been able to con¬ 
dition physiological reactions as well as defensive 
behaviors to variations in the intensity of visceral 
stimulation from internal organs. 

The conditioning process described above has im¬ 
portant implications for the understanding of, among 
other things, psychosomatic disorders, defensive be¬ 
havior, and evaluative reactions having an affective 
component. In the psychosomatic field Dekker, 
Pelser, and Groen [1957] established asthmatic 
attacks in two patients by pairing formerly ineffective 


stimuli with allergens that evoked respiratory dys¬ 
functions. A careful analysis of the situations ir. 
which other patients regularly experienced asthmatic 
attacks revealed that a varied array of stimuli had 
acquired controlling value; these included, among 
other things, radio speeches by influential politicians, 
children’s choirs, the national anthem, elevators, 
goldfish, caged birds, the smell of perfume, water¬ 
falls,, bicycle races, police vans, and horses. Once the 
conditioned stimuli had been identified in a particu¬ 
lar case, Dekker and Groen [1956] were able to 
induce attacks of asthma by presenting the evocative 
stimuli in actual or pictorial form. 

A great deal of human behavior is under aversive 
stimulus control. In this form of emotional learning, 
persons, places, and events become endowed with 
anxiety-arousing value through association with pain¬ 
ful experiences. A prime function of most anticipa¬ 
tory behavior is to provide protection against poten¬ 
tial threats. Once established, defensive behavior is 
exceedingly difficult to eliminate because it derives 
self-reinforcing power from its capacity to reduce 
distress. Any protective activities that remove or 
avoid discomfort aroused by conditioned threats 
are thereby strengthened even though the fears may 
no longer be realistically justified. This process is 
graphically illustrated by the apocryphal case of a 
compulsive who, when asked by a psychiatrist why 
he incessantly snapped his fingers, replied that it 
kept ferocious lions away. When informed that 
obviously there were no lions in the vicinity, the 
compulsive client replied, "See, it works.” Inhibi¬ 
tions and avoidance responses commonly regarded 
as neurotic (for example, phobias and obsessive- 
compulsive rituals) are similarly strengthened by 
their capacity to lessen subjectively distressing but 
objectively nonexistent threats. To the extent that 
formerly aversive situations are successfully avoided, 
individuals prevent themselves from discovering that 
current conditions of reinforcement may differ sub¬ 
stantially from those in the past, when their anxiety 
reactions were appropriate. 

In an early-study Miller [1948] showed how sub¬ 
jective threats can motivate, and fear reduction re¬ 
inforce, a variety of defensive behaviors in a cur¬ 
rently benign environment. Animals were shocked in 
a white compartment of a shuttle box where they 
quickly learned to escape the painful stimulation by 
running through an open door into a black compart¬ 
ment. The formerly neutral white cues thus acquired 
aversive properties so that the animals continued to 
run when placed in the white box even after the 
shocks had been completely discontinued. To examine 
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further acquisition and self-perpetuation of new de- 

r ive absence of any physical 

danger, Miller placed the animals in the white com¬ 
partment with the door closed to prevent escape. The 
door could be released, however, by rotating a 
wheel. Wheel turning was rapidly learned and main- 
tamed by fear reduction. When environmental con¬ 
ditions were further changed so that wheel turning 
no longer released the door, but the animal could 
escape the compartment by pressing a bar, the former 
response was quickly discarded while the latter be¬ 
came strongly established. An animal psychiatrist 
observing these winded animals dashing through 
compartments, turning wheels, and pressing bars to 
avoid nonexistent shocks would justifiably diagnose 
em as suffering a serious psychological disorder. 
Their behavior was clearly out of touch with reality 
wSf conditioning. Conditioning principles 
would have limited explanatoiy power if emotional 
responses could be established only through direct 
physical experiences. It is not uncommon for people 
o display strong emotional reactions toward certain 
mgs or classes of people on the basis of little or 
no personal contact with them. Such responses are 
frequency developed on the basis of higher processes 
hlcb symbolic stimuli that have acquired positive 
or negative valence through direct association with 

primary experiences serve as the basis for further 
conditioning. 

Emotion-arousing words often function as the ve¬ 
hicle for symbolic conditioning. Words that conjure 

up feelings of revulsion and dread can be effectively 
used to create new fears and hatreds; ^ y 

words arousing positive emotions can endow associa- 

invech Ven i tS ^ plcaSing 1 ualities - I" laboratory 
' 1 10 e S ° f s >' lnbolic conditioning neutral sylla¬ 

bles have been found to take on negative value 
trough repetitive pairing with adjectives having 
negative connotations (for example, ugly, dirty) 

SeTXv h Se T 6 it6mS ar£ evaluat «d as pleasant 
HiH A ^ 3 j 6 been associated with positively con- 
iboned words such as beautiful and happy [Insko 

evahfaH 5 ’ & St3atS 1957] ' ^existing 

son ^ rea ? S t0Ward famiJiar names of per 

Sroirnl TaT a ' S0 been “gently altered 
through conditioning methods using emotional words 

^ evocahve stimuli [Stoats & Staats 1958], Moreover 

that e en f JaC n bSO u fI9?0] provide evidence 

that emotionally charged words can condition auto- 

nomic responses as well as evaluative reactions to 

toward St f nul1 ' EvaluaUve responses occur not only 
toward objects singled out for conditioning bit 

they also tend to generalize along establisheifasso- 


ciative networks, thus resulting in widespread effects 
Das and Nanda [1963], after associating neutral 
syllables with the names of two aboriginal tribes 
conditioned positive and negative reactions to the 
syllables by pairing them with affective words. The 
tribes took on positive and negative values in ac¬ 
cordance with the evaluative responses developed to 
the syllables with which they had been associated. 

Symbolic conditioning may be achieved to some 
extent through pictorial stimuli having affective 
properties. Geer [1968], for example, conditioned 
autonomic responses to tones with photographs of 
victims of violent death. The role of conditioning 
processes is perhaps nowhere more dramatically illus¬ 
trated than in the marked cross-cultural variations in 
the physical attributes and adornments that become 
sexual arousers. What is invested with arousal prop¬ 
erties in one society-corpulence or skinniness, up¬ 
right hemispherical breasts or long pendulous ones 
shmy white teeth or black pointed ones, distorted 
ears, noses, or lips, broad pelvis and wide hips or 
narrow pelvis and slim hips, light or dark skin color 
may be neutral or repulsive to members of another 
social group. A bold experiment by Rachman [19661 
on how fetishes might be acquired throws some light 
on symbolic conditioning of sexual arousal. After a 
photograph of women's boots was repeatedly associ¬ 
ated with slides of sexually stimulating nude females, 
males exhibited sexual arousal (as measured by 
penile volume change) to the boots alone and 
generalized the conditioned sexual responses to other 
types of black shoes. Needless to say, these unusual 
sexual reactions were thoroughly eliminated at the 
conclusion of the study. Consistent with the latter 
findings, McGuire, Carlisle, and Young [1965] ad¬ 
vance the view that deviant sexuality often develops 
through masturbatory conditioning in which aberrant 
sexual fantasies are endowed with strong erotic value 
through repeated association with pleasurable experi¬ 
ences from masturbation. 

Vicarious conditioning. While many emotional 
responses are learned on the basis of direct experi- 
ence, much human learning undoubtedly occurs 
through vicarious conditioning. The emotional re¬ 
sponses of another person, as conveyed through 
vocal, facial, and postural manifestations, can arouse 
strong emotional reactions in observers. Affective 
social cues most likely acquire arousal value as a 
result of correlated experiences between people. 
That is individuals who are in high spirits tend to 
eat others in amiable ways, which arouse in them 
similar pleasurable affects; conversely, when indi¬ 
viduals are dejected, ailing, distressed, or angry, 
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others are also likely to suffer in one way or another. 
This speculation receives some support in a study 
by Church [1959], who found that expression of pain 
by an animal evoked strong emotional arousal in 
animals that had suffered pain together; it had much 
less emotional effect on animals that had undergone 
equally painful experiences hut unassociated with 
suffering of another member of their species, and it 
left unmoved animals that were never subjected to 
any distress. 

In vicarious conditioning, events take on evocative 
properties through association with emotions aroused 
in observers by affective experiences of others. In 
laboratory studies of this phenomenon, an observer 
hears a tone and shortly thereafter he sees another 
person exhibit pain reactions (actually feigned) as 
though he were severely shocked [Berger 1962]. 
Observers who repeatedly witness this sequence of 
events begin to show emotional responses to the 
tone alone even though no pain is ever inflicted on 
them. In everyday life, of course, pain may be wit¬ 
nessed from a variety of sources. Observation of 
failure experiences and the sight of terrified people 
threatened by menacing animals have, for instance, 
served as arousers for emotional learning [Bandura, 
Blanchard, & Ritter 1969, Craig & Weinstein 1965]. 

Despite the importance of vicarious learning, there 
has been surprisingly little study of the factors de¬ 
termining how strongly people can be affectively 
conditioned through the experiences of others. The 
nature of the relationship between the observer and 
the sufferer is undoubtedly an influential factor. 
People are generally less affected emotionally by the 
adversities of strangers than by the suffering and joy 
of those close to them and on whom they depend. 
Observers’ sensitivity to expressions of suffering, 
derived from their past social experiences, may be 
another contributor. Bandura and Rosenthal [1966], 
for example, found that the degree to which observ¬ 
ers were emotionally aroused affected their level of 
vicarious conditioning. Those who were under mod¬ 
erate emotional arousal displayed the highest rate 
and most enduring conditioned autonomic responses, 
whereas those who were either quite calm or highly 
aroused showed the weakest vicarious conditioning. 
Apparently, anguished reactions proved so upsetting 
to observers who themselves were beset by high 
arousal that they diverted their attention from the 
suffering person and sought refuge in distracting 
thoughts of a calming nature. 

It is evident from the preceding discussion that 
emotional learning is much more complex than is 
commonly assumed. Emotional responses can be 
brought under the control of intricate combinations 


of internal and external stimuli that may be either 
closely related to, or temporally remote from, physical 
experiences. The fact that stimulus events can be en¬ 
dowed with emotion-arousing potential on a vicarious 
basis further adds to the complexity of conditioning 
processes. Additionally, it will be shown later that for¬ 
merly neutral stimuli can acquire evocation properties 
through association with thought-produced arousal. 
Once conditioned stimuli have acquired eliciting 
power, this capacity transfers to other sets of stimuli 
that possess similar physical characteristics, to seman¬ 
tically related cues, and even to highly dissimilar 
stimuli that happen to be associated in peoples 
experiences. 

By associating the term behaviorimi with odious 
images of salivating dogs and animals driven by 
carrots and sticks, critics of behavioral approaches 
skillfully employ Pavlovian conditioning procedures 
on their receptive audiences to endow this point of 
view with degrading properties. The fact that valua¬ 
tion of places, persons, and things is affected by 
one's emotional experiences, whether they be fear¬ 
ful, humiliating, mournful, or pleasurable, does not 
mean that such conditioning outcomes reflect a base 
animal process. To expect people to remain unaffect¬ 
ed by paired experiences is to require that they be 
less than human. Moreover, to be sensitive to the 
consequences of one's actions indicates intelligence 
rather than subhuman functioning. 

Cognitive Control of Conditioning Phenomena 

Both popular accounts and psychological descrip¬ 
tions of conditioning phenomena convey the impres¬ 
sion that emotional responsiveness is conditioned 
automatically through paired stimulation occurring 
in a certain temporal relationship. This view is 
typically reinforced by diagrammatic portrayal of the 
process as a nonmediated one in which conditioned 
stimuli are directly connected to responses evoked by 
unconditioned stimuli. Results of several lines of 
research indicate that, in humans, conditioning 
phenomena, cannot be fully understood without en¬ 
compassing the influence of cognitive control. 

Studies of the relationship between awareness of 
paired events and degree of learning reveal that con¬ 
ditioning involves more than simply linking stimuli 
to new responses through contiguous association. 
Fuhrer and Baer [1965] report an experiment in 
which one tone was always followed by shock but a 
different tone was presented alone. People who 
recognized that one of the tones signified shock re¬ 
sponded emotionally whenever it appeared, whereas 
those who remained unaware of the stimulus con- 
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tingeneies did not. Other researchers have likewise 
shown that repetitive association of neutral and 
aversive stimuli does not produce conditioned emo¬ 
tional responses in people who fail to recognize that 
the two sets of events are related [Dawson & Grings 
1968, Dawson & Satterfield 1969]. 

That awareness is a determinant of conditioning 
rather than vice versa is convincingly demonstrated 
by Chatterjee and Eriksen [1962]. People who were 
informed that shock would follow a particular word 
in a chain of associations quickly developed condi¬ 
tioned heart-rate responses. In contrast, subjects who 
were led to believe that the occurrence of shock was 
not related in any consistent way to their verbaliza¬ 
tions evidenced no autonomic conditioning even 
though they repeatedly experienced the same paired 
stimulation as their aware counterparts. 

The most striking evidence of cognitive control of 
conditioned responses is provided by studies com¬ 
paring extinction of emotional reactions to condi¬ 
tioned stimuli in subjects who are informed that the 
stimuli are no longer followed by painful events and 
in those who are never told that the threat has ceased 
to exist. Induced awareness promptly abolishes con¬ 
ditioned autonomic responses [Grings & Lockhart 
1963, Notterman, Schoenfeld, & Bersh 1952, Wickens, 
Allen, & Hill 1963] and avoidance behavior [Lindley 
& Moyer 1961, Moyer & Lindley 1962], whereas un¬ 
informed subjects lose their fear only gradually. 

Self-arousal interpretation of conditioning. Ac¬ 
cording to social learning theory, conditioned emo¬ 
tional responses are typically mediated through 
thought-produced arousal rather than being directly 
evoked by conditioned stimuli. The power to arouse 
emotional responses is by no means confined to 
external events. People can easily make themselves 
nauseated by imagining revolting experiences. They 
can become sexually aroused by generating erotic 
fantasies. They can frighten themselves by fear- 
provoking thoughts. And they can work themselves 
up into a state of anger by ruminating about mis¬ 
treatment from offensive provocateurs. Indeed, Barber 
and Hahn [1964] found that imagined painful stimu¬ 
lation produced subjective discomfort and physio¬ 
logical responses similar to those induced by the 
actual painful stimulation. The incomparable Satchel 
Paige, whose extended baseball career provided 
many opportunities for anxious self-arousal, color¬ 
fully described the power that thoughts can exert 
over viscera] functioning when he advised, “If your 
stomach disputes you lie down and pacify it with 
cool thoughts.” 

The findings reported in the preceding section are 
consistent with the self-arousal interpretation of con¬ 


ditioning. In aware individuals, events that forebode 
distress activate fear-arousing thoughts that produce 
emotional responses. On the other hand, those who, 
for one reason or another, fail to notice that the con¬ 
ditioned stimulus foreshadows pain do not conjure 
up frightening ideas. As a result, the conditioned 
stimulus rarely evokes emotional responses even 
though it is often paired with unpleasant experiences. 
Sudden disappearance of conditioned emotional re¬ 
sponses following awareness that the threat has been 
removed is also explainable in terms of self-arousal 
processes. Given such knowledge, conditioned stim¬ 
uli no longer activate frightening thoughts, and a 
major source of emotional responses is thus removed. 

It follows from self-arousal theory that emotional 
conditioning can be achieved on a cognitive basis in 
the absence of physically painful experiences. Grings 
and others [Bridger & Mandel 1964, Dawson & 
Grings 1968] provide some evidence on this point. 
In their experiments, subjects are told that a given 
stimulus will sometimes be followed by shock, but 
except for a sample experience, this in fact never 
occurs. Formerly neutral stimuli acquire arousal 
capabilities through association with thought-pro¬ 
duced emotional responses. That covert self-stimula¬ 
tion plays an influential role in this type of 
cognitively based conditioning is shown by Dawson 
[1966]. He reports that the level of emotional con¬ 
ditioning was positively related both to the degree 
to which subjects believed that they would be 
shocked and to the severity of pain they anticipated. 

Although the development and extinction of con¬ 
ditioned emotional responses are subject to cognitive 
control, this does not mean that all conditioned re¬ 
sponses are necessarily consciously mediated. The 
degree of cognitive control partly depends on the 
conditions under which the emotional responses are 
originally acquired. Bridger and Mandel [1964] 
found that autonomic conditioning was similar re¬ 
gardless of whether stimuli were associated with 
threat of shock alone or with threat combined with 
shock experiences. Emotional responses established 
on the basis of actual painful experiences, however, 
were less susceptible to cognitive control. Thought- 
induced conditioned responses promptly disappeared 
with the knowledge that shocks would no longer be 
forthcoming. By contrast, conditioned emotional 
responses originating in painful experiences persisted 
for some time despite awareness that the physical 
threat was completely removed. 

These findings may be explained in several ways. 

It may mean that conditioned responses contain dual 
components, as Bridger and Mandel suggest. One of 
the components—created by self-arousal influences— 
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is readily modifiable by having a person alter his 
thoughts. The second component may be a non- 
mediated one that is directly evoked by external 
stimuli and hence requires disconfirming experiences 
for its extinction. Snake phobics, for example, will 
instantaneously respond with fear at the sight of a 
coiled snake before they have time to cogitate about 
the potential dangers of reptiles. 

An alternative interpretation is that in instances 
where people either have undergone distressing ex¬ 
periences or there is a remote chance that they might 
get hurt, external stimuli become such powerful elici- 
tois of fear-provoking thoughts that they are not 
easily subject to voluntary control. To tell phobics 
who dread heights that they can gaze down safely 
from the rooftop of a tall building because of protec¬ 
tive railings does not mean that they will be able to 
turn off thoughts about the horrendous things that 
could conceivably happen. Here fearfulness is still 
cognitively mediated but the individual is unable to 
control his thoughts, however safe the eliciting situa¬ 
tions may be. 

Nonmediational theories of conditioning assume 
that associated stimulus events must be registered in 
the nervous system of the organism. Studies exam¬ 
ining the influence of awareness on conditioning 
should therefore obtain evidence that the input from 
the stimulus to be conditioned has in fact been re¬ 
ceived. It is not inconceivable that in experiments 
where, to reduce awareness, subjects’ attention is 
diverted to irrelevant features of the situation, the 
conditioned stimuli may not be registered in a 
sufficiently consistent manner to produce stable con¬ 
ditioned responses. Neural responses to afferent input 
can be substantially reduced by focusing attention 
on irrelevant events. In neurophysiological studies 
by Hemandez-Peon, Scherrer, and Jouvet [1956], for 
instance, auditory neural responses to a loud sound 
were virtually eliminated in cats when they gazed 
at mice, attentively sniffed fish odors, or received 
shocks that disrupted their attentiveness. Horn 
[1960] noted a similar weakening of neural responses 
to a light flash during active attention to other sights 
and sounds. 

People who direct their attention to extraneous 
features may neither experience nor recognize the 
conditioned stimulus. Absence of conditioning, under 
such circumstances, may be erroneously attributed to 
lack of conscious recognition when, in fact, it derives 
from deficient sensory registration of the stimulus. 
Proof that awareness is necessary for learning would 
require evidence that, despite adequate neural regis¬ 
tration of the paired environmental events, classical 


conditioning does not occur unless the relationship 
between the events is recognized. 

Developmental theories of psychological function¬ 
ing often draw sharp distinctions between associa¬ 
tive and cognitive processes as though they repre¬ 
sented independent functions. As noted above, 
cognitive influences can markedly affect perform¬ 
ances regarded as associative in nature. Extent of 
cognitive control can, in turn, be influenced by asso¬ 
ciative factors. Mandel and Bridger [1967], for in¬ 
stance, demonstrated that awareness hastened fear 
extinction, but the speed with which aware subjects 
lost their apprehensions depended on how closely 
and in what order neutral and aversive stimuli were 
associated when the fear was first learned. Most 
changes in behavior undoubtedly result from the 
interactive effects of associative and cognitive influ¬ 
ences. 

Disorders Arising from Inappropriate 
Stimulus Control 

The development of stimulus functions has con¬ 
siderable adaptive value, but unfortunatelv, as 
alluded to in some of the illustrative material, it can 
also create needless distress and constricting defen¬ 
siveness. Dysfunctions of this sort can arise in several 
different ways. Events that happen to occur in the 
context of traumatic experiences but are in no way 
causally related to them sometimes take on aversive 
properties and produce inappropriate generalization 
of anxiety reactions. The following letter taken from 
the advice column of a newspaper illustrates such 
inappropriate generalization. 

Dear Abby: 

My friend fixed me up with a blind date and I should 
have known the minute he showed up in a bow tie 
that he couldn't be trusted. I fell for him like a rock. 
He got me to love him on purpose and then lied to 
me and cheated on me. Every time I go with a man 
who wears a bow tie, the same thine happens. I think 
girls should be warned about men who wear them. 

Against Bow Ties 

In this above example, the letter writer had gene¬ 
ralized a whole pattern of social behavior to bow 
ties, a stimulus one would not expect to be routinely 
correlated with deceitfulness. To the extent that her 
anticipatory hostile actions evoke negative counter¬ 
reactions in bow-tied men, her defensive behaviors 
create adverse experiences and are thereby self- 
perpetuated. Here the inappropriate behavior is 
maintained by a self-produced reality rather than 
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one that had been in effect in the past but no longer 
exists under changed conditions of life. 

Irrational defensive behaviors often occur on the 
basis of overgeneralization from events associated 
with traumatic experiences to others that are similar 
to them either physically or semantically. In the 
often quoted study by Watson and Rayner [1920], 
for example, several pairings of a rat and a loud 
sound produced in a young boy, not only marked 
fear of the rat, but the fear generalized widely to 
other furry objects including rabbits, dogs, fur coats, 
cotton, wool, and even human hair. 

Clinical evidence exists that suggests that relatively 
innocuous stimuli may be invested with powerful 
aversive properties through semantic generalization. 
Walton and Mather [1963] report the case of a wo¬ 
man who suffered from obsessions about being dirty 
and spent much of her life performing incapacitating 
ygienic rituals. 

The main feature was compulsive handwashing which 
appeared to arise from doubts about contamination of 
herself by dirt. After every daily activity or whenever 
she believed she may have touched something which 
had been handled by another person before ner, she 
would wash her hands four or five times (taking fifteen 
minutes). Door knobs and taps were particularly 
anxiety-provoking as they were handled most fre¬ 
quently. Going to the toilet would always be followed 
by handwashing and additional scrubbing of the nails. 
Taking a bath and washing her hair would occupy sev¬ 
eral hours due to continual rewashing of herself, the 
bath, and the washbasin. To avoid contamination, she 
would ensure that she never brushed against walls or 
other people’s clothing and she always kept her own 
clothes in a special place untouched by others. Any 
street or thoroughfare would be examined for patches 
of dirt. Many such activities, such as turning on taps 
and opening doors were delegated to her mother. She 
would in fact never venture outside the house alone 
lest she were required to handle something and there 
were no facilities for washing. Sitting on public seats 
was also difficult for her [p. 169 ]. 

The obsessive-compulsive behavior began with her 
severe guilt and feelings of “dirtiness” because of 
sexual relations in a love affair with a married man. 
Eventually, a wide range of stimuli related to uro¬ 
genital activities and all forms of dirt became dis¬ 
turbing to her. 

Maladaptive emotional responses can not only be 
created by unpleasant paired experiences, but they 
can be eliminated by therapeutic methods utilizing 
similar principles [Bandura 1969a, Franks 1969]. 
This is achieved by repeated exposure to threatening 
events without the occurrence of any adverse con¬ 
sequences. Several variant procedures have been de¬ 
vised and applied with relatively high success. In the 


extinction method disabling anxieties are removed 
by gradually exposing clients to progressively more 
frightening situations or through massive contact 
with what they fear. Some understandably loathe 
facing what they dread, thus requiring positive in- 



desensitize anxious people by presenting to them the 
things- they fear together with anxiety neutralizers 
(for example, muscular relaxation, tranquilizing im¬ 
agery, or the reassurance and security of a supportive 
person) capable of eliciting highly positive reactions. 
By reducing the level of anxiety arousal, positive 
activities allow persons to tolerate threats they might 
ordinarily avoid without experiencing them as un¬ 
duly aversive. A third major approach relies upon 
modeling procedures wherein anxiety disorders are 
eliminated by having anxious clients observe others 
engaging in threatening activities without experi¬ 
encing any untoward consequences. After undergo¬ 
ing this form of treatment they lose their fears, they 
can engage in activities they formerly inhibited, and 
they develop more favorable attitudes toward the 
things they abhorred. 

Stimulus Control of Action 

The same actions can produce markedly different 
consequences, depending upon the time, the place, 
and the persons toward whom they are expressed. 
Driving through a busy intersection on a red signal, 
for example, will have painfully different effects from 
crossing on a green light. People therefore come to 
attend closely to stimuli that predict reinforcement 
and to ignore those that do not, and they utilize cues 
that signify probable consequences in regulating 
their behavior. Stimuli indicating that given actions 
will be punished or nonrewarded tend to inhibit their 
performance, whereas those signifying that the ac¬ 
tions are permissible or rewardable facilitate their 
occurrence. The capacity to regulate responsiveness 
on the basis of information conveyed by antecedent 
stimuli about likely response consequences provides 
the mechanism for foresightful behavior. 

Stimuli acquire controlling power by being corre¬ 
lated with differential response consequences. Tra¬ 
ditional explanations of how stimulus control is de¬ 
veloped focus primarily on direct training in which 
responses are reinforced only in the presence of cer¬ 
tain cues but never in a different stimulus context. 
Stimulus control is undoubtedly established and 
maintained in many instances through correlation 
with experienced response consequences. However, 
man's symbolic capacity enables him to gain such 
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information without having to perform responses 
and experience them rewarded, ignored, or punished 
under varied conditions. Much learning of this sort 
is achieved through verbal explanations that describe 
situations in which certain actions are regarded as 
appropriate or out of place. Moreover, as we shall 
see later, stimulus control can be effectively estab¬ 
lished by observing how the behavior of others is 
reinforced in different situations. Although people 
often guide their actions solely on the basis of what 
they have observed or been told, maintenance of 
verbally and vicariously produced stimulus control 
ordinarily requires periodic confirmation through 
direct experience. 

Interpersonal behavior is partly regulated by char¬ 
acteristics of people that predict the consequences 
likely to accompany certain courses of action. Chil¬ 
dren often behave quite differently in the presence 
of their mothers and their fathers in accordance with 
their disciplinary practices. The following quotation 
provides a telling example of an autistic boy who 
free y expressed destructive behavior with his lenient 
mother but rarely did so in the presence of his father, 
who tolerated no aggression. 


V\ henever her husband was home. Billy was a model 
youngster. He knew that his father would punish him 
quickly and dispassionately for misbehaving. But when 
his father left the house, Billy would go to the window 
and watch until the car pulled out. As soon as it did, 
he was suddenly transformed. . . . "He'd go into mv 
closet and tear up my evening dresses and urinate oh 
my clothes He d smash furniture and run around bit- 
mg the walls until the house was destruction from one 
end to the other He knew that I liked to dress him in 
nice clothes, so he used to rip the buttons off his shirts 
and used to go in his pants [Moser 1965, p. 96]. 

In a formal study of social stimulus control Redd 
and Bimbrauer [1969] had one adult reward a group 
O seclusive children for playing cooperatively, while 
a second aduU rewarded them equally regardless of 
them behavior. Later the mere appearance of the 
contingently rewarding adult evoked cooperative 
play, but the noncontingent adult exerted no influ¬ 
ence on the children's behavior. When the adults re- 
versed their reinforcement practices their power to 
elicit play behavior changed accordingly. 

People frequently regulate their behavior on the 
basis of more subtle social cues. To take a common 
example parents are quick to issue commands to 
their children, but they do not always see to it that 
their requests are heeded. Children are therefore in¬ 
clined to ignore demands voiced in mild or moderate 
tones. The parents' mounting anger usually serves as 
tiie cue that they will enforce compliance so that 


only shouts produce results. Indeed, many house¬ 
holds are run at a fairly high decibel level. 

Of the numerous stimuli that influence how people 
will behave at any given moment, none is more 
ubiquitous or effective than the actions of others. 
People applaud when others clap, they laugh when 
others laugh, they exit from social functions when 
they see others leaving, and on countless other occa¬ 
sions their behavior is prompted and channeled by 
modeling stimuli. J 

The actions of others acquire controlling proper¬ 
ties through selective reinforcement in much the 
same way as do physical and symbolic stimuli in 
nonsocial forms. When behaving like others pro¬ 
duces rewarding outcomes, modeling cues become 
powerful determinants of analogous behavior; con¬ 
versely, when matching actions are treated nega¬ 
tively but dissimilar behavior proves rewarding, 
models’ responses prompt divergent performances in 
observers. Because people usually display modes of 
behavior that are appropriate and effective, follow¬ 
ing good examples is much more reinforcing than 
tedious trial and error. As a result, modeling cues 
generally assume high predictive value. Thus, by 
relying on the actions of knowledgeable models, a 
novice can behave appropriately in synagogues, in 
mosques, in saloons, at wedding ceremonies, and in 
countless other situations, without having to discover 
the acceptable conduct through shocked or pleased 
reactions to his unguided performances. The dictum 
When in Rome do as the Romans do" underscores 
the functional value of modeling stimulus control. 

People differ in the degree to which their behavior 
is guided by modeling influences, and not all models 
are equally effective in eliciting the types of behavior 
they themselves exemplify. Responsiveness to model¬ 
ing influences is largely determined by tliree factors, 
which derive their power from correlation with con¬ 
ditions of reinforcement. These include the charac¬ 
teristics of the models, the attributes of the observer, 
and the response consequences associated with match¬ 
ing behavior. 

With regard to model characteristics, those who 
have high status, prestige, and power are much more 
effective in evoking matching behavior in observers 
than models of low standing. The force of prestigeful 
example is well illustrated in a study of behavioral 
contagion conducted by Lippitt, Polansky, and Rosen 
[1952]. Children in summer camps rated each other 
in terms of their power to influence others. Observers 
then recorded the incidence and sources of behav¬ 
ioral contagion, defined as spontaneous imitation of 
the actions of another person where he displayed no 


18 



Social Learning Theory 


intent to get others to follow his example. The be¬ 
havior of a few models who were attributed high 
power largely determined the conduct of the camp 
members. The influence of prestigeful modeling is 
demonstrated even more convincingly by Lefkowitz, 
Blake, and Mouton [1955]. Pedestrians were more 
likely to cross a street on a red light when they saw 
a presumably high-status person in executive attire 
do it than when the same transgression was per¬ 
formed by the same model dressed in soiled patched 
trousers, scuffed shoes, and a blue denim shirt. 

It is not difficult to explain why model status 
facilitates matching behavior. The actions of models 
who have gained some status are more likely to be 
successful and hence have greater functional value 
for observers than the behavior of models who pos¬ 
sess relatively low vocational, intellectual, and 
social competencies. In situations where people are 
uncertain about the wisdom of modeled courses of 
action, they rely on model characteristics and status- 
conferring symbols (for example, speech, dress, de¬ 
portment, possession of material goods), which 
serve as tangible indicants of past successes. The 
effects of a model's prestige tend to generalize from 
one area of behavior to another, as when prominent 
athletes advise on breakfast cereals as though they 
were nutrition experts. Unfamiliar persons also gain 
influence by their similarity to models whose behav¬ 
ior proved successful in the past. 

Some efforts have been made to identify the types 
of people who are most responsive to modeling in¬ 
fluences. Observers who achieve better outcomes by 
following the examples of others than through their 
own independent behavior are readily susceptible to 
modeling influences. Therefore, those who have been 
frequently rewarded for imitative behavior and those 
who lack self-esteem, who feel incompetent, and who 
are highly dependent are especially prone to pattern 
their behavior after successful models [Bandura 
1969b, Campbell 196]]. 

The preceding generalizations must be accepted 
with reservations because the functional value of 
modeled behavior overrides the influence of either 
model or observer characteristics. It is exceedingly 
unlikely that dull, anxious, and dependent students 
would gain more from observing skilled ski instruc¬ 
tors, brain surgeons, or airline pilots than understud¬ 
ies who are bright, attentive, and self-assured. When 
modeled activities are highly valued, the more ven¬ 
turesome and talented observers benefit most from 
exposure to exemplary models. Laboratory studies 
reporting prosaic correlates of imitativeness typically 
employ inconsequential modeled behaviors that com¬ 


petent observers would have little incentive to learn. 

The attributes of models exert greatest influence 
when it is unclear what consequences their behavior 
is likely to have. Observers must therefore rely on 
status cues that may have been correlated with rein¬ 
forcement in their past experiences. A prestigeful or 
attractive model may induce a person to try a given 
course of action, but if the behavior should prove 
unsatisfactory, it will be discarded and the model’s 
future influence diminished. For this reason studies 
in which response consequences are not evident may 
exaggerate the role played by model characteristics 
in the long-term control of social behavior. 

In everyday life cues that predict the likely con¬ 
sequences of different actions appear as part of a 
bewildering variety of irrelevant events. To compli¬ 
cate matters further, the rules governing reinforce¬ 
ment often involve particular combinations of sev¬ 
eral environmental factors. To borrow an example 
from the study of conceptual behavior, a person is 
asked to sort a set of pictures that differ in a variety 
of ways into correct and incorrect piles without being 
told the relevant feature. Let us arbitrarily designate 
all pictures containing an adult drinking an alcoholic 
beverage as positive instances of the concept, and 
those that do not as negative instances. As the per¬ 
son sorts the pictures on the basis of provisional 
guesses, he observes which of his placements get re¬ 
warded and he eventually abstracts the common fea¬ 
ture from the array of irrelevant stimuli. Response 
consequences are generally determined by the joint 
presence of several factors. Continuing with the 
above example, only pictures depicting drinking by 
an adult at home or in a bar, in the evening, and in 
the company of others might be considered appro¬ 
priate, whereas solitary or daytime drinking and 
imbibing in work settings would not. Here the rein¬ 
forcement rule combines temporal, social, and situ¬ 
ational cues. The important role played by cognitive 
processes in rule-regulated behavior is discussed later. 

Defective stimulus control. Effective social func¬ 
tioning requires highly discriminative responsiveness, 
often to subtle variations in stimulus events. Some 
behavior disorders primarily reflect defective stimu¬ 
lus control, due either to faulty reinforcement prac¬ 
tices or to a loss of such functions under stressful 
experiences. Rosenbaum [1953, 1956], for example, 
found that people were less able to distinguish criti¬ 
cal from irrelevant features of the environment under 
strong than mild threat, and those prone to emotional 
arousal were most adversely affected in this regard. 

Social behavior is extensively regulated by verbal 
cues. We influence people's actions in innumerable 
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situations by suggestions, requests, commands, and 
wntten directives. Because of the importance of sym¬ 
bolic communication in human relationships, defi¬ 
cient or inappropriate responsiveness to verbal stim¬ 
uli can have serious consequences. As part of a pro¬ 
gram to develop procedures for modifying psychotic 
behavior, Ayllon and his associates [Ayllon & Haugh- 
ton 1962, Ayllon & Michael 1959] provide illustra¬ 
tions of defective verbal control. In one study, a 
group of schizophrenics with severe chronic eating 
problems were totally unresponsive to meal an¬ 
nouncements or to persuasive appeals. Because of 
concern for their health, patients were escorted by 
nurses to the dining room, spoon-fed, tube-fed, and 
subjected to electroshock “therapy" and other forms 
of infantilizing and punitive treatments. It appeared 
that the nurses' coaxing, persuading, and feeding 
inadvertently reinforced the eating problems. Also, 
by rewarding nonresponsiveness to verbal requests, 
they lost their directive function. All social rewards 
for ignoring the announcement of mealtime and for 
refusals to eat were therefore withdrawn; following 
meal call, the dining room remained open for thirty 
minutes and any patient who failed to appear during 
that time simply missed his meal. After these conse¬ 
quences were instituted, patients responded in a 
socially appropriate manner to meal calls and fed 
themselves. 

In his studies of language learning in autistic chil¬ 
dren, Lovaas [1967] shows how the influence of 
modeling stimuli can be negated by faulty reinforce¬ 
ment practices. Autistic children who lacked com¬ 
municative speech imitated the therapists verbali¬ 
zations with high accuracy when rewards were made 
contingent upon correct speech reproductions. Bv 
contrast, when children were equally generously re¬ 
warded but without regard to the quality of their 
verbalization, their imitative behavior progressively 
deteriorated until it bore little resemblance to the 
verbal responses modeled for them by the therapist. 

Reinforcement Control 

An organism that responded foresightedly on the 
basis of informative environmental cues but re¬ 
mained unaffected by the results of its actions would 
be too obtuse to survive for long. In fact, behavior is 
extensively controlled by its consequences. Responses 
that cause unrewarding or punishing effects tend to 
be discarded, whereas those that produce rewarding 
outcomes are retained and strengthened. Human be¬ 
havior therefore cannot be fully understood without 
examining the powerful influence of reinforcement 
control. 


Traditional theories of reinforcement have been 
almost entirely concerned with demonstration of how 
behavior can be regulated by directly experienced 
consqeuences arising from external sources. Out of 
this circumscribed research interest grew the un¬ 
fortunate impression that behavior theories view 
man as a manipulable automaton with hardly any 
self-regulatory capacities. Social learning theory, 
while acknowledging the important responsive guid¬ 
ing role played by extrinsic feedback, posits a wider 
range of reinforcement influences. People are not 
only affected by the experiences created by their ac¬ 
tions; they also regulate their behavior to some ex¬ 
tent on the basis of observed consequences, as well 
as those they create for themselves. These three dif¬ 
ferent forms of reinforcement control—direct, vicar¬ 
ious, and self-monitored—are considered next. 

External Reinforcement 

Some of the most impressive demonstrations of 
how behavior is controlled by its immediate conse¬ 
quences are found in the treatment of behavior dis¬ 
orders. Such studies usually employ an “intrasubject 
replication design" in which troublesome behavior is 
successively eliminated and reinstated by systematic 
variation of reinforcement contingencies. One case, 
selected from a large number reported by Harris, 
Wolf, and Baer [1964], illustrates the four-step pro¬ 
cedure. 

First, the person who is having difficulties is ob¬ 
served for a time to assess the incidence of the devi¬ 
ant behavior, the contexts in which it tends to occur, 
and the reactions it elicits from others. In the case 
under discussion, an extremely withdrawn boy spent 
about 80 per cent of his time secluded in isolated 
areas of the nursery school. Observation revealed 
that the teachers unwittingly reinforced his seclu- 
siveness by paying a great deal of attention to him 
when he remained by himself, reflecting his feelings 
of loneliness, consoling him, and urging him to play 
with others. On the infrequent occasions when he 
happened to join other children, the teachers took 
no special notice. 

In the second phase of the program a new set of 
reinforcement practices is instituted. Continuing 
with the above example, the teachers stopped re¬ 
warding solitary play with attention and support. 
Instead, whenever the boy sought out other children, 
a teacher joined the group and gave it her full atten¬ 
tion. In a short time, the boy’s withdrawal declined 
markedly and he was spending about 60 per cent of 
his time playing with other children. 

After the desired changes have been produced, the 
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original reinforcement practices are reinstated to de¬ 
termine whether the deviant behavior was in fact 
maintained by its social consequences. In this third 
stage, for example, the teachers behaved in their 
customary way, being inattentive to his sociability 
but responding with comforting ministrations when 
he was alone. The effect of this traditional “mental 
hygiene approach was to drive the child back into 
seclusiveness. Clearly, social practices should be 
evaluated in terms of the effects they have on recipi¬ 
ents rather than in terms of the humanitarian intent 
of the practitioners. 

In the final phase, the therapeutic contingencies 
are reintroduced, the deviant patterns are eliminated, 
and the adaptive ones are generously rewarded until 
they then are adequately supported by their natural 
consequences. In the above case, the teachers grad¬ 
ually reduced their rewarding attentiveness as the 
boy derived increasing enjoyment from play activi¬ 
ties with his peers. In follow-up observations he con¬ 
tinued to enjoy his social relationships, which con¬ 
trasted conspicuously with his previous seclusiveness. 

Similar treatments conducted with both children 
and adults reveal that a wide variety of grossly devi¬ 
ant behaviors—including self-injurious actions, hypo¬ 
chondriacal and delusional preoccupations, infantile 
and regressive patterns, extreme withdrawal, chronic 
anorexia, psychogenic seizures, asthmatic attacks, 
psychotic tendencies, and countless other disorders 
of long standing—have been successfully eliminated, 
reinstated, and removed for a second time by alter¬ 
ing the amount of social reinforcement they elicit 
from others. 

The regulatory influence of reinforcement has been 
demonstrated not only with deviant conditions but 
with all other forms of behavioral functioning as 
well. Until recently it was widely believed that inter¬ 
nal physiological states could be externally aroused 
but were not subject to reinforcement control. In an 
ingenious series of studies the eminent psychologist 
Neal Miller [1969] succeeded in altering visceral 
responsiveness in animals, including changes in heart 
rate, blood pressure, blood flow, intestinal contrac¬ 
tions, and even rate of urine formation by reinforcing 
the animals whenever physiological responses of a 
selected rate or magnitude occurred. The preciseness 
of reinforcement control is most impressively re¬ 
vealed in a study in which animals were rewarded 
for relatively greater volume flow of blood to one ear 
than to the other. The differential reinforcement pro¬ 
duced corresponding changes in the amount of blood 
flow to the two ears. 

Preliminary studies indicate that this line of re¬ 
search holds promise for advancing understanding of 
the development of psychosomatic disorders and 


their treatment. Voluntary control of one's own phys¬ 
iological functioning is achieved through instantane¬ 
ous feedback techniques. In this procedure, people 
are told to maintain a given level of physiological re¬ 
sponsiveness during which they are signaled with a 
tone or light whenever they achieve a desired state 
of body function. Dworkin and Miller [Miller, 
Di Cara, Solomon, Weiss, & Dworkin 1970] found 
that a subject could increase and decrease his dia¬ 
stolic blood pressure to some degree by informative 
feedback of changes in selected directions. In a fas¬ 
cinating project, Kamiya [1968] succeeded in teach¬ 
ing people to control their alpha brain states (exper¬ 
ienced as feelings of serene well-being) by monitor¬ 
ing their brain waves and sounding a tone when they 
reduced their brain waves to a rhythm between eight 
to twelve cycles per second. During alpha periods 
subjects felt relaxed, they exerted no mental effort, 
and they experienced no visual imagery. 

Budzynski Stoyva, and Adler [1970] successfully 
applied the biofeedback method in treating tension 
headaches resulting from sustained contraction of 
scalp and neck muscles. Patients heard a tone with a 
frequency proportional to the electromyographic 
(EMG) activity in the monitored forehead muscles. 
They were instructed to keep the tone low by relax¬ 
ing their facial muscles. As the patients became more 
adept at muscular relaxation, the criterion was in¬ 
creased in graded steps requiring progressively more 
relaxation to achieve low-pitched tones. Through 
this method patients who had experienced daily 
headaches over a period of several years quit tensing 
their facial muscles and eventually eliminated their 
headaches. The clinical applicability of biofeedback 
techniques will depend on the degree of voluntary 
control people can exert on their physiological re¬ 
sponses. It would appear from preliminary findings 
that they possess greater self-regulatory capacities 
than was previously believed. 

Reinforcement control of behavior is further shown 
by evidence that people behave quite differently de* 
pending on the pattern and frequency with which 
their actions are reinforced. Those who have been 
rewarded each time they respond are likely to be¬ 
come easily discouraged and to give up quickly when 
their efforts fail. On the other hand, individuals 
whose behavior has been reinforced intermittently 
tend to persist for a considerable time despite set¬ 
backs and only occasional success. 

Intermittent reinforcement can take a variety of 
forms. Sometimes behavior is reinforced only after a 
specified period of time has elapsed ( fixed-interval 
schedule), as in pay periods, eating schedules, and 
recreational cycles. When rewards occur on a fixed 
temporal basis, the payoff is the same regardless of 
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tiie person s productivity; hence, behavioral, output 
is relatively low. For this reason people are more 
often rewarded on the basis of the amount of work 
they accomplish rather than on the passage of time. 
In a fixed-ratio schedule a person must complete a 
given amount of work for each reinforcement. Be¬ 
cause outcomes depend on ones own behavior, re¬ 
sponsiveness remains high. 

In everyday life most reinforcements occur not 
only periodically but in an irregular manner. For 
instance, superiors rarely check their employees’ 
work at the same time each day, and achievements 
regarded as adequate on one occasion may be con¬ 
sidered insufficient at another. Under variable- 
interval schedules individuals are reinforced at 
changeable times; in variable-ratio schedules the per¬ 
formances required for each reinforcement vary. 
Since in both instances the rewards occur unpredict- 
a y, variable schedules produce higher and more 
consistent performances than those in which out¬ 
comes occur with evident regularity. Even under 
variable schedules, outcomes occurring on a per- 
ormance rather than on a time basis produce higher 

icevi ° f res P oasiveness - Ferster [Ferster & Skinner 
1957] created a striking example of schedule influ¬ 
ences by rewarding simultaneously the right-hand 
responses of a subject on a fixed-ratio and the left- 
hand responses on a variable-ratio schedule. The sub¬ 
ject produced two remarkably different sets of per¬ 
formances, each corresponding to the typical response 
rates generated by these types of schedules. 

As a further way of illustrating schedule control of 
behavior let us consider how patrons of the gambling 
devices at Las Vegas might behave under different 
systems of reinforcement. Given a fixed-interval 
schedule, where playing a slot machine produces a 
|ackpot each time an hour has elapsed, patrons would 
deposit a coin and then sit around watching the 
clock until it was time to insert the next coin for the 
payoff. Under a fixed-ratio schedule requiring, for 
example, 25 plays to produce a jackpot, patrons 
would be seen busily stuffing slot machines at a high, 
stable rate. If the customary reinforcement was no 
longer forthcoming after an hour in the first case 
and after 25 responses in the second, players would 
rapidly quit with loud complaints. On a variable- 
interval schedule—where players receive a jackpot on 
the average every hour but it can occur unpredict- 
ably after 10 minutes, a half hour, or 2 hours-they 
are likely to wait for a while immediately after gain¬ 
ing a jackpot and then play at a moderate, steady 
rate. Put on a variable-ratio schedule-wherc on the 
average every 25th response produces a payoff but 
sometimes^ responses are sufficient, whereas at other 
times 10, 50, or even as many as 200 responses may 


be needed to get a jackpot-players would continue 
cranking the machine at a rapid pace without a 
break until their money ran out, fatigue set in, or 
their spouses intervened. Since the variable-ratio 
schedule is most powerful in maintaining behavior 
at minimum costs, owners of gambling casinos under¬ 
standably favor it. Behavior that has been reinforced 
on a thin variable-ratio schedule is exceedingly hard 
to extinguish because one’s efforts are sustained by 
the belief that the actions will eventually prove suc¬ 
cessful and it takes a long time to realize that the 
rewards are no longer forthcoming. 

In everyday life mixed schedules of reinforcement 
predominate, with both the number of unreinforced 
responses and the intervals between reinforcements 
continually changing. Drivers, for example, are not 
fined for every traffic violation. Speakers are not 
equally applauded every time they express their 
views. And scheduled activities do not always occur 
on time. Behavior is usually associated with vari¬ 
able outcomes because people are not always around 
to mediate the reinforcing consequences. Even when 
they are present, their preferences often differ so 
that the same actions may be applauded, ignored, or 
disapproved, depending on with whom one interacts. 

The matter of reinforcement control is much more 
complicated than would appear from the discussion 
us far. If people did not have the power of counter¬ 
control, they could be impelled to do almost any¬ 
thing simply by arranging the appropriate conse¬ 
quences. In social interactions, however, participants 
are dependent upon each other to get what they 
want and consequently they have some power over 
each other. When conflicts arise, as they often do, 
compromise systems are adopted that are acceptable 
to both parties. As power disparities change, so do 
reinforcement systems. When employers enjoyed 
commanding positions, they paid their employees on 
a piece-rate basis representing various fixed-ratio 
schedules. As organized labor gained increasing 
power, it was able to negotiate interval schedules in 
which prescribed wages were guaranteed on a daily, 
weekly, monthly, and eventually on an annual basis. 

In order to ensure high performance, many em¬ 
ployers combined ratio and interval schedules by 
paying fixed wages pius commissions based on indi¬ 
vidual or group productivity. Ir. many other areas of 
functioning people understandably strive for power 
that would enable them to remove bothersome per¬ 
formance requirements for desired outcomes. Further 
illustrations are provided later of how the potential 
power of external reinforcement is weakened and 
counteracted by reciprocal control and by the influ¬ 
ence of other reinforcement systems. 

The opinion is often expressed that desired activi- 
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ties should be performed for their own sake. It is 
feared that reinforcing practices may not only inter¬ 
fere with the development of self-determining char¬ 
acteristics but may render people so dependent upon 
extrinsic supports that they rem ain unr esponsive 
without payoffs. Some of the more intemperate 
critics, whose own activities are generously rein¬ 
forced by salaries, consulting fees, book royalties, 
and the applause of sympathetic audiences, consider 
the use of reinforcement to be manipulative and 
debasing. 

The fact that behavior is controlled by its conse¬ 
quences is not a phenomenon created by behavioral 
scientists, any more than physicists are responsible 
for the laws of gravity. The process of natural selec¬ 
tion has favored organisms adept enough to regulate 
their behavior on the basis of the effects it produces. 
Continuing with the above example, a social com¬ 
mentator might express moral indignation over gravi¬ 
tational control of behavior and denounce it as de¬ 
humanizing and degrading. A poetic view of man 
might be more flattering but it would in no way 
reduce the likelihood that people will continue to fall 
should they fling themselves off heights. The major 
purpose of psychological science is not to romanticize 
human behavior but rather to understand it. 

Some of the criticisms that have been levied against 
reinforcement practices fail to recognize the com¬ 
plexity and the developmental changes in the conse¬ 
quences that influence behavior. At the earliest de¬ 
velopmental levels, infants and young children are 
responsive only to immediate physical events, such 
as food, painful stimulation, and physical contact. It 
would be foolhardy for parents to rely on children's 
self-actualizing tendencies to keep them out of fires, 
electric outlets, or busy thoroughfares. In the course 
of development, as physically rewarding experiences 
are repeatedly associated with expressions of interest 
and approval, and punishments with disapproval and 
withdrawal of interest, the social reactions them¬ 
selves eventually acquire reinforcing properties. At 
this higher level of psychological functioning behav¬ 
ior is extensively governed by symbolic social rein¬ 
forcements. It would be a rare, unfeeling person who 
could remain totally indifferent to the sentiments of 
others. 

Some child-rearing authorities popularized the 
view that healthy personality development can be 
achieved only through "unconditioned love." If this 
principle were, in fact, applied, parents would re- 
J spond warmly and affectionately regardless of how 
their children behaved—whether or not they treated 
others maliciously, stole whatever they wanted, 
amassed a record of failing grades, showed contempt 


for the wishes and rights of others, and demanded 
immediate gratification of their desires. Uncondi¬ 
tioned love, were it possible, would make children 
directionless, irresponsible, asocial, and quite unlov¬ 
able. Most readers are probably acquainted with 
families where parents who attempted to approxi¬ 
mate this condition succeeded in producing "self- 
actualized” tyrants. Guideless interest is clearly not 
enough. Fortunately, the vast majority of parents are 
not indiscriminate dispensers of affection. Being hu¬ 
man, they tend to be displeased with reprehensible 
actions. It also comes as no surprise that some of the 
strongest advocates of unconditional regard are quite 
selective in their own social responsiveness, approv¬ 
ing things they like and disapproving those they do 
not [Murray 1956, Truax 1966]. And recipients of 
this differential treatment accommodate to their 
preferences. 

There are a number of symbolic reinforcers, other 
than social responses, that take on reinforcing func¬ 
tions. Money, having exchange value for countless 
things that people want, is a dependable and dur¬ 
able generalized reinforcer of behavior. When ex¬ 
emplary achievements are recurrently rewarded, 
qualitative differences in performance acquire rein¬ 
forcing value. After signs of progress and merited 
attainment become a source of personal satisfaction, 
knowledge that one has done well can function as a 
reward. However, the reinforcement ensuing from 
successful results is most likely mediated through 
self-reinforcement. For this reason, correctness feed¬ 
back on tasks that are personally devalued or very 
simple is unlikely to have much reinforcing value. 
On the other hand, attainments that exceed personal 
standards of what constitutes a worthy performance 
activate positive self-evaluations, whereas inadequate 
accomplishments produce self-dissatisfaction. 

Many of the activities that enhance competency 
are initially tiresome and uninteresting and it is not 
until one acquires proficiency in them that they as¬ 
sume reinforcing functions. During initial stages of 
skill development children are often coerced to per¬ 
form required activities by demands and threats that, 
more often than not, instill antipathies rather than 
competencies. This prevalent problem can be largely 
avoided by rewarding children's efforts, both socially 
and tangibly, until the behavior is developed to the 
stage at which it produces more natural reinforcing 
effects. Thus, for example, children may temporarily 
require extrinsic encouragement to teach them how 
to read, but after they become skilled at it they read 
on their own for the enjoyment it provides. Many 
other forms of behavior, such as verbal facility and 
manual skills, which enable people to deal more 
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effectively with their environment, do not require 
arbitrary rewards to sustain them. • 

, S ° me Performances are also partially maintained 
by the sensory reinforcement they produce. Infants 
for example, repeatedly perform responses for cer- 
tam sounds and sights and older children and adults 
spend long hours playing musical instruments that 
create pleasing sensory feedback. When activities 
are sustained by the sensory effects they naturally 
create, the phenomenon is usually designated as 
mtrmsic reinforcement. This process is often distin¬ 
guished from extrinsic reinforcement, where the out¬ 
comes, such as social approval or monetary rewards 
represent arbitrary response consequences. The rein- 
orcing value of most forms of sensoiy feedback is 
learned rather than inherently furnished. Indeed 
many of the things that people enjoy doing for then- 
own sake, whether playing atonal music or writing 
compositions, were probably experienced originally 
as somewhat aversive. As a result of repeated expo- 
ure, stall acquisition through initial extrinsic rein- 
forcement, and investment of self-esteem rewards in 
merited performances, however, the activities even- 
tuaily take on positive value. 

The highest level of autonomy is achieved when 
individuals regulate their own behavior by self- 
evaluative and other self-produced consequences. 
The paramount role played by self-reinforcement in 
controlling human actions is given detailed consider¬ 
ation in a later section of this paper. Behavior is 
east susceptible to the vagaries of externally occur¬ 
ring reinforcement when effective consequences are 
el her intrinsically related to the behavior or are 
self-administered. Paradoxically, the types of sym¬ 
bolic and self-regulatory mechanisms that human- 
lsbcally onented commentators consider to be an- 

b f ha V° raI a PP r °aches are, in fact, most 
successfully developed by practices derived from 
principles of social learning. 


Vicarious Reinforcement 

Human functioning would be exceedingly inefB- 

tD . " on dangerous, if behavior were 
controlled only by directly experienced consequences. 
Fortunately people can profit greatly from the ex- 
p ences of others. In everyday situations reinforce¬ 
ment typically occurs within a social context. That is 
people repeatedly observe the actions of others and 
the occasions on which they are rewarded, ignored 
or punished. Despite the fact that observed rewards’ 
and punishments play an influential role in regulat¬ 
ing behavior, vicarious reinforcement has, until re¬ 


cent years, been essentially ignored in traditional 
theories of learning. 

There is a second reason why the study of vicar¬ 
ious reinforcement is critical to the understanding of 
reinforcement influences. Observed consequences 
provide reference standards that determine whether 
a particular reinforcer that is externally administered 
will serve as a reward or as a punishment. Thus, for 
example, the same compliment is likely to be dis- 
couraging to persons who have seen similar per¬ 
formances by others more highly acclaimed, but re¬ 
warding when others have been less generously 
praised. J 

Besearch on the relational character of reinforcing 
events has shown that the same consequence can 
have rewarding or punishing effects on behavior de¬ 
pending upon the nature, frequency, or generosity 
with which ones performances were previously rein- 
forced^ However, incentive contrast effects, resulting 
from discrepancies between observed and directly 
experienced consequences, have received relatively 
little attention. 7 

Vicarious Punishment 


v icarious reinforcement is defined as a change in 
the behavior of observers resulting from seeing the 
response consequences of others. Vicarious punish¬ 
ment is indicated when observed negative conse¬ 
quences reduce people's tendency to behave in sim¬ 
ilar or related ways. This phenomenon has been 
studied most extensively with respect to aggressive 
behavior. In the typical experiment [Bandura 1965 
Bandura, Ross, & Ross 1963bJ children are shown a 
film depicting a model engaging in novel aggres¬ 
sive behaviors that are either rewarded, punished 
or unaccompanied by any evident consequences.' 

itnessing aggression punished usually produces 
less imitative aggression than seeing it obtain social 
and material success or go unnoticed. 

Because of the variety and complexity of social 
influences, people are not always consistent in how 
they respond to aggressive behavior. Rosekrans and 
Har+up [1967] examined the effects of discrepant 
observed consequences on imitative aggression. Chil¬ 
dren who saw assaultive behavior consistently re¬ 
warded were most aggressive, those who saw it con¬ 
sistently punished displayed virtually no imitative 
behavior while those who saw aggression sometimes 
rewarded and sometimes punished exhibited a mod¬ 
erate level of aggressiveness. 

A second major set of experiments has been con¬ 
cerned with how vicarious punishment affects peo- 
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pies willingness to violate prohibitions. Walters and 
bis associates [Walters, Leat, & Mezei 1963, Walters 
& Parke 1964, Walters, Parke, & Cane 1965] have 
shown that witnessing peer models punished for vio¬ 
lating prohibitions increases observers’ inhibition of 
transgressive behavior as compared with conditions 
in which modeled transgressions are either rewarded 
or simply ignored. Results of a comparative study by 
Benton [1967] indicate that, under some conditions, 
observed and directly experienced punishment may 
be equally effective in reducing deviant behavior. 
Children who observed peers punished for engaging 
in prohibited activities later showed the same amount 
of response inhibition as the punished transgressors. 

An interesting experiment by Crooks [1967] re¬ 
veals that lower species are also highly susceptible 
to observed punishments. After being tested for the 
extent to which they handled play objects, monkeys 
observed distress vocalizations sounded (through a 
tape recorder) whenever a model monkey touched 
a particular object; they also witnessed the model's 
contacts with a control object accompanied by the 
distress vocalizations played backwards, which did 
not sound like a pain reaction. In a subsequent test 
the observing animals played freely with the control 
item but actively avoided objects that supposedly 
produced painful experiences for another animal. 

In all of the preceding studies the model was 
punished either verbally or physically by someone 
else. In many instances persons respond with self- 
punitive and self-devaluative reactions to their own 
behavior that may be considered permissible, or 
even commendable, by others. Numerous experi¬ 
ments, which are discussed later, demonstrate that 
witnessing punishments self-administered by a model 
has inhibitory effects on observers with respect to 
unmerited achievements. Observation of self-punish¬ 
ment by a model has been shown by Porro [1968] to 
exert similar effects on transgressive behavior. For 
children who viewed a filmed model exhibit self- 
approving responses to her transgressions, 80 per 
cent subsequently handled toys they were forbidden 
to touch, whereas the transgression rate was only 20 
per cent for children who had observed the same 
model respond self-critically toward her own trans¬ 
gressions. 


Vicarious Positive Reinforcement 

Behavior can be enhanced as well as reduced by 
observed outcomes. Vicarious positive reinforcement 
is evident when observers display an increase in be¬ 
havior for which they see others rewarded. Results 


of numerous experiments generally show that ob¬ 
served rewards produce a greater increase in similar 
responding than if the exemplified actions have no 
evident consequences. In the case of behavior that 
is ordinarily disapproved, however, seeing transgres¬ 
sions go unpunished seems to heighten analogous 
actions in observers to the same degree as witnessing 
models rewarded [Bandura 1965, Walters & Parke 
1964, Walters, Parke, & Cane 1965]. To the extent 
that absence of anticipated punishment conveys per¬ 
missiveness and allays fears, behavioral restraints 
are thereby reduced and transgressive actions are 
performed more readily. 


Relative Effectiveness of Direct and 
Vicarious Reinforcement 


How do observed consequences compare with di¬ 
rectly experienced ones in their power to influence 
behavior? The answer to this question partly depends 
upon whether one assesses effects in terms of what 
is learned or what is performed. By watching the 
types of consequences produced by different actions 
in various settings, observers can learn the responses 
considered appropriate in given situations. Observers 
generally learn faster than reinforced performers, 
especially on tasks requiring conceptual behavior 
(Berger 1961, Hillix & Marx 1960, Rosenbaum & 
Hewitt 1966]. It is not difficult to find reasons for the 
relative superiority of vicarious reinforcement. Per¬ 
formers may be impeded in discerning which of their 
actions lead to success by the need to create, to 
select, and to enact responses and by the emotional 
arousal resulting from experienced rewards and pun¬ 
ishments. Observers, on the other hand, can give 
their undivided attention to discovering the correct 
solutions. 


ui vicarious ana airect rein¬ 
forcements is reversed with respect to their motiva¬ 
tional effects, as reflected in the capacity to maintain 
effortful behavior over a long period. One would not 
recommend to employers, for example, that they 
maintain the productivity of their employees by hav- 
ing them witness a small group of workers receive 
pay checks at the end of each month. Seeing others 
rewarded may temporarily enhance responsiveness 
but it is unlikely by itself to have much sustaining 
power. Observation of other people's outcomes, how¬ 
ever, can have a continuing influence on the effec¬ 
tiveness of direct reinforcement by providing a stand¬ 
ard for judging whether the reinforcements one cus¬ 


tomarily receives are equitable, beneficeri^-Or unfair. 
Since both direct and vicarious reinforcements in- 
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evitably occur together in everyday life, the inter- 
,iotive__egeQts. of these two sources of influence - on" 
human behavior are of much greater significance 
than their independent controlling power. This as¬ 
sumption is borne out by evidence that seeing how 
others are reinforced can significantly increase or re¬ 
duce the effectiveness of direct rewards and punish¬ 
ments m changing observers’ responsiveness [Con- 
drell 1967, Ditrichs, Simon, & Greene 1967, Marlatt 
1968], 

Consistent with the preceding findings, explana¬ 
tions of social behavior emphasize relative rather 
than absolute reinforcement in determining the level 
. of productivity and discontent within a society. Dis- 
( advanta ged people may be rewarded more gener-/ 

I ously than in the past but still experience greater 
1 discouragement and resentment because the more', 

I affluent members of society make more rapid prog- \ 
‘ress, so that the disparity between the groups widens. 1 

Explanation of Vicarious Reinforcement 

Vicarious reinforcement is simply a descriptive 
term that does not explain how observed conse¬ 
quences produce their psychological effects. Social 
earning theory posits several different mechanisms 
by which witnessed rewards and punishments alter 
the actions, feelings, and thoughts of others [Bandura 
1971c], A vicanous reinforcement event may vary in a 
number of aspects, including the characteristics of 
the recipient and the reinforcing agents, the type 
and intensity of consequences, their justifiability, the 
situation in which reinforcements are administered, 
and the reactions of the participants. The number 
and type of mechanisms that are operative in any 
given instance will therefore depend upon the par- 
ticular combination of these component factors. 

One explanation of vicarious reinforcement is in 
terms of the informative function of observed out¬ 
comes. Response consequences accruing to others 
convey information to observers about the types of 
actions that are likely to be approved or disapproved 
Given knowledge about probable response conse¬ 
quences, people will generally do the things they 
have seen well received and avoid those that they 
have seen punished. 

The same behavior can produce markedly different 
consequences depending on the setting in which it is 
performed. What is permissible in a nightclub may 
be censurable in a church. Hence, adaptive function¬ 
ing requires not only response information but 
knowledge about what actions are appropriate in 
what setting. In modeling experiments where per¬ 
formers are rewarded for responding in a given con¬ 


text but ignored or punished for exhibiting the same 
behavior in a different situation, observers learn to 
discriminate critical features in their environment. 
[McDavid 1964, Wilson 1958], In this way, vicarious 
reinforcement increases responding to stimuli corre¬ 
lated with observed rewards and decreases respond¬ 
ing to stimuli signifying negative response conse- 
quences. 

Observed reinforcement is not only informative but 
can also have incentive motivational effects. Seeing 
others reinforced can function as a motivator by 
arousing in observers expectations that they will be 
similarly rewarded for analogous performances. An 
experiment reported by Burning [1965] illustrates 
how variations in the size of observed rewards, while 
providing equivalent information about the types of 
responses required for reinforcement, produce difier- 
ent levels of responsiveness through their motiva¬ 
tional effects on observers. Children who had ob¬ 
served a performer generously rewarded subse¬ 
quently responded more rapidly when they received 
smaller rewards for the same actions, whereas when 
observed rewards were smaller than the ones observ¬ 
ers later received, they worked more slowly. These 
unexpected findings were attributed by Bnining to 
the energizing effects of frustration in the under¬ 
compensated condition, and to satiation in the bene¬ 
ficent treatment. The frequency with which others 
were rewarded has also been shown to affect how 
long observers will persist before they give up when 
their efforts are never reinforced [Berger & Johansson 
1968, Rosenbaum & Bruning 1966]. Variations in the 
generosity with which other people are reinforced 
thus determines the speed, the vigor, and the per¬ 
sistence with which others behave. 

Models generally exhibit emotional reactions while 
undergoing rewarding or punishing experiences. 
Observers are easily aroused by the emotional ex- 
pressions of others. It was previously shown how 
vicariously elicited emotions can become conditioned 
either to the modeled behavior or to environmental 
stimuli that are regularly associated with performers' 
distress reactions. As a result of vicarious emotional 
conditioning, the negatively vaJenced stimuli or the 
matching responses are likely to frighten and to in¬ 
hibit observers. Emotional arousal and behavioral 
inhibitions can also be extinguished by having fearful 
observers watch performers engaging in threatening 
activities without experiencing any adverse conse¬ 
quences. Vicarious conditioning and extinction of 
emotional arousal may therefore partially account for 
increases and decreases in responsiveness that result 
from observing affective consequences accruing to 
models. 


26 



Social Learning Theory 


In everyday situations people not only see the con¬ 
sequences of another’s behavior but how he responds 
to his treatment. There is some evidence, presented 
by Ditrichs, Simon, and Greene [1967], that the 
performers responsiveness to reinforcement signifi¬ 
cantly affects how observers later react when they 
themselves are rewarded for displaying similar be¬ 
havior. Children who observed models give progres¬ 
sively more hostile responses for social approval later 
increased their own output of hostile responses under 
positive reinforcement, whereas when models gave 
progressively fewer rewarded hostile responses or 
reacted in random fashion, observers did not increase 
their expression of hostility even though they were 
positively reinforced whenever they did so. Suscep¬ 
tibility to direct reinforcement influences is thus 
increased by observed positive responsiveness and 
reduced by observed resistance. 

In addition to the aforementioned effects of vicar¬ 
ious reinforcement, social status can be conferred on 
performers by the manner in which their behavior 
is reinforced. Punishment tends to devalue the model 
and his behavior, whereas the same model assumes 
emulative qualities when his actions are praised and 
otherwise rewarded [Bandura, Ross, & Ross 1963b, 
Hastorf 1965]. Modification of model status, in turn, 
influences the degree to which observers pattern their 
own actions after behavior exemplified by different 
models. There are conditions, of course, where ob¬ 
served punitive treatment enhances rather than low¬ 
ers the recipient's social status. People who risk pun¬ 
ishment for upholding their basic rights and beliefs 
gain the admiration not only of their peers but of 
others as well, especially when the protest is directed 
at social practices that violate the professed values of 
society. It is for this reason that authoritative agen¬ 
cies are usually careful not to discipline deviators 
in ways that might martyr them. 

Observed reinforcements can alter the valuation 
of reinforcing agents a5 well as recipients. When 
societal agents misuse their power to reward and 
punish, they undermine the legitimacy of their au¬ 
thority and generate strong resentment. Under these 
conditions seeing inequitable punishment, rather 
than prompting compliance, may free incensed ob¬ 
servers from self-censure of their own actions and 
thus increase transgressive behavior. This is most 
likely to occur when retaliative counterreactions are 
self-justified as rectifying past grievances or prevent¬ 
ing further maltreatment. Otherwise considerate peo¬ 
ple can thus be readily provoked by observed injus- 
) tice to behave cruelly without remorse. 

None of the foregoing explanations assumes that 
vicarious reinforcement produces its effects by 


strengthening connections between stimuli and re¬ 
sponses. Such a mechanism of operation would require 
observers, not only to perform covert matching re¬ 
sponses concurrently with the model, but to experi¬ 
ence indirectly the reinforcements. A reinforcement 
process of this type is not implausible, though it 
seems highly improbable. 

Although the preceding discussion is concerned 
with possible mechanisms through which vicarious 
reinforcement affects observers, the alternative ex¬ 
planations apply equally to interpretation of how 
direct reinforcement influences performers. Rein¬ 
forcements convey information to performers about 
the types of responses that are appropriate- selective 
reinforcement directs performers’ attention to corre¬ 
lated environmental stimuli that signify probable re¬ 
sponse consequences; previous reinforcements create 
expectations that motivate actions designed to secure 
desired rewards and to avoid injurious outcomes; 
punishing experiences can endow persons, places, 
and things with fear-arousing properties that inhibit 
responsiveness; a given history of positive or nega¬ 
tive reinforcement can alter people’s self-evaluations 
in ways that affect their willingness to exhibit behav¬ 
iors that are discrepant with their self-attitudes and 
the determination with which they perform them; 
and finally, the treatment one receives alters liking 
and respect for the reinforcing agent. 

Self-Reinforcement 

The discussion thus far has illustrated how people 
regulate their behavior on the basis of response con¬ 
sequences that they either observe or experience 
firsthand. If actions were determined solely by exter¬ 
nal rewards and punishments, people would behave 
like weathervanes, constantly shifting in radically 
different directions to conform to the whims of 
others. They would act like segregationists with a 
racial bigot, like John Birchers with a zealous Bircher, 
like Communists with a devoted Communist, like 
Republicans with a staunch conservative, and like 
scoundrels with a villainous character. Close scru¬ 
tiny of social interactions would most likely reveal, 
barring powerful coercive pressures, steadfast adher¬ 
ence to ideological positions rather than compliant 
behavior reversals. Anyone who attempted to change 
a Bircher into a Communist, or a Catholic into an 
atheist, would quickly come to appreciate the exist¬ 
ence of potent internal sources of behavior control. 

The notion that behavior is controlled by its con¬ 
sequences is unfortunately interpreted by most peo¬ 
ple to mean that actions are at the mercy of situa¬ 
tional influences. In fact, behavior can, and is, exten- 
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sively self-regulated by self-produced consequences 
for one s own actions. Ir writing a term paper or pre¬ 
paring a manuscript for publication, for example, 
authors do not require someone sitting at their sides 
differentially reinforcing each written statement until 
a satisfactory version is produced. Rather, authors 
possess a standard of what constitutes an acceptable 
work and they engage in repeated self-corrective 
editing of their own writing performances until they 
are satisfied with what they have written. The self¬ 
editing often exceeds external requirements of what 
would be satisfactory to others. Indeed, some people 
are such severe self-editors that they essentially para¬ 
lyze their own writing efforts. In most other areas of 
functioning people similarly set themselves certain 
performance standards arid respond to their own 
behavior in self-satisfied or self-critical ways in ac¬ 
cordance with their self-imposed demands. Because 
of their greater representational and self-reactive ca¬ 
pacities, humans are less dependent upon immediate 
external supports for their behavior. The inclusion of 
self-reinforcement phenomena in learning theoiy thus 
greatly increases the explanatory power of reinforce¬ 
ment principles as applied to human functioning. 

After a self-monitoring reinforcement system has 
been developed, a given action typically produces 
sets of consequences-a self-evaluative reaction 
and an external outcome. These two sources of rein¬ 
forcement can occur in several different patterns. 
Sometimes people are rewarded socially or mate¬ 
rially for behavior that they devalue. Anticipation of 
self-reproach for personally repudiated actions pro¬ 
vides an important motivating influence to keep be¬ 
havior in line with adopted standards in the face of 
opposing influences. I here is no more devastating; 
punishment than self-contempt. Under conditions 
where self-devaluative consequences outweigh the 
force of rewards for accommodating behavior, exter¬ 
nal influences prove relatively ineffective. On the 
other hand, when external inducements, whether re¬ 
warding or coercive, prevail over self-reinforctn* 
mfluences, individuals exhibit cheerless compliance. 
Humans, of course, are equipped with facile coPni- 
hve capacities for reconciling distressing discrepan- 
cies Disliked actions can be, and often are, justified 
so that losses in self-respect are minimized as long as 
the self-deception remains convincing. 

An opposite type of conflict between externa] and 
self-produced consequences arises when people are 
pumshed for engaging in activities they value highly. 
Here, the relative strength of self-approval and exter¬ 
nal censure determine whether the behavior will be 
discarded or maintained. Another common situation 


is one where external reinforcement for certain activ¬ 
ities is minimal or lacking and individuals sustain 
their efforts largely through self-encouragement. Ex¬ 
ternal reinforcement exerts greatest influence when 
it is harmonious with, rather than contravenes, self- 
produced consequences. People strive actively to 
achieve and to maintain such conditions. They do 
this by selectively associating with persons who share 
similar behavioral standards, thus ensuring social 
support-for their own system of self-evaluation. 

There are three major aspects of self-reinforcement 
that warrant detailed comment. These issues are con¬ 
cerned with questions of how self-monitoring rein¬ 
forcement systems are established; how effective they 
are in regulating behavior; and what maintains them. 

Establishment of Self-Reinforcing Functions 

Self-reinforcement functions can be acquired in 
several different ways. One of these is through the 
process of selective reinforcement. People learn to 
evaluate their behavior partly on the basis of how 
others have reacted to it. Parents and other socializa¬ 
tion agents subscribe to certain norms of what con¬ 
stitute worthy performances. They are generally de¬ 
lighted and respond approvingly when children 
achieve or exceed desired standards and displeased 
when their performances fall short of the valued 
level. As a result of such differential treatment chil¬ 
dren eventually come to respond to their own behav¬ 
ior in self-approving and self-critical ways, depend¬ 
ing on how it departs from evaluative standards set 
by others. Some indirect support for the effects of 
direct training on self-reinforcement practices is pro¬ 
vided by Kanfer and Marston [1963]. They found 
that adults who had received indulgent treatment 
subsequently rewarded their performances more gen¬ 
erously than did those who had been stringently 
trained, even though the actual achievements of both 
groups were comparable. 

People not only prescribe self-evaluative standards 
for others, they also exemplify them in response to 
their own behavior. It is amply documented in psy¬ 
chological research that modeling is another influ¬ 
ential means of transmitting systems of self-reinforce¬ 
ment. In the procedure typically used to study this 
process, children or adults observe a model perform¬ 
ing a task in which he adopts either a high or a low 
performance standard for self-reinforcement. On 
trials in which the model achieves or exceeds his 
self-imposed demand, he rewards himself tangibly 
and voices self-praise; but when his attainments fall 
short of self-prescribed requirements, he denies him- 
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self freely available rewards and reacts in a self- 
derogatory manner. Observers later perform the task 
alone, receiving a predetermined set of scores. The 
performances for which they reward and punish 
themselves are recorded. 

Results of such experiments show that people tend 
to adopt standards of self-reinforcement displayed 
by exemplary models, they evaluate their own per¬ 
formances relative to that standard, and then' they 
serve as their own reinforcing agents. In a study by 
Bandura and Kupers [1964] children who observed 
a model set a high standard of self-reinforcement 
later rewarded themselves sparingly and only when 
they achieved superior performances, whereas chil¬ 
dren exposed to models who considered low achieve¬ 
ments deserving of self-reward tended to reinforce 
themselves for mediocre performances. A control 
group of children, who had no exposure to models, 
did not reward themselves selectively for differential 
levels of achievement. Subjects in the experimental 
conditions not only adopted the modeled standards 
of self-reinforcement but matched variations in the 
generosity with which the models rewarded their 
own performances. 

Social groups contain members of widely differing 
abilities, so that a given individual must select the 
modeled standards against which to evaluate his 
own accomplishments. His level of self-satisfaction 
and of self-disappointment will be determined to a 
large extent by the models with whom he compares 
himself. A study by Bandura and Whalen [1966] 
found that children readily adopted self-reinforce¬ 
ment patterns displayed by either low-achieving 
models who were satisfied with mediocre perform¬ 
ances or moderately competent models subscribing 
to self-reward standards within their reach. Accord¬ 
ing to social comparison theory, people tend to choose 
reference models similar in ability and to disregard 
those who are too divergent from themselves. Con¬ 
sistent with this view, children rejected lofty stand¬ 
ards of superior models and adopted lower self-reward 
requirements within the range of their achievements. 

Although there is an understandable reluctance to 
emulate exacting norms of distinguished models, it 
is nevertheless not uncommon for people to adopt 
stringent standards of self-reinforcement. Indeed, 
universities are heavily populated with students who 
are self-satisfied only with superior performances in 
whatever academic work they undertake. We con¬ 
ducted an experiment in our laboratory to explore 
some of the social conditions that might lead people 
to emulate austere standards of self-reinforcement 
even though adherence to such demands results in 


frequent self-dissatisfaction [Bandura, Grusec, 6 
Menlove 1967]. 

Children observed a highly capable adult who re 
warded himself only when he obtained superior 
scores that children rarely achieved when they late) 
performed the same task. Prior to this experience 
the adult model treated half the children in a warm 
rewarding manner and the others in a neutral, busi 
nesslike fashion. The quality of the social relation 
ship was varied on the assumption that the reward 
ingness of a model, which tends to increase interper 
sonal attraction, would facilitate emulation of the 
model’s exacting norms. 

Adherence to high performance standards is gen 
erally publicly acclaimed. Without societal valuatior 
most people would forego lofty aspirations if onl) 
because their attainment requires arduous work anc 
much self-denial of readily available gratifications 
To measure the effects of observed social rewards 
with half the children in the experiment the aduf 
model was praised for adhering to stringent stand 
ards of self-reinforcement, but with the remaining 
children the model received no social recognition fo: 
his high-standard-setting behavior. 

Ordinarily, individuals are exposed to a variety o: 
modeling influences, many of which operate in op 
posing directions. Speculations about the influenc( 
of multiple modeling on social learning general!) 
assign importance to conflicting identification witl 
adult and peer models. To study how children re 
solve the problem of simultaneous exposure to antag 
onistic modeling influences, half the children in eacl 
subgroup observed both the stringent adult and i 
peer model who displayed a low standard of self 
reward. When faced with a conflict between adul 
and peer standards, children would be predisposec 
toward peer modeling. Not only would a peer b< 
viewed as a more appropriate comparison model bu 
emulation of high aspirations results in frequent self 
criticism of one’s performances, which discourage: 
adoption of such standards. It was assumed, how 
ever, that the tendency for peer modeling to reduce 
the impact of adult modeling might be counteractec 
by opposing influences arising from positive ties tc 
the adult model and from social recognition of high 
standard-setting behavior. 

Children exposed to conflicting modeling influ 
ences were more inclined to reward themselves fo: 
low achievements than children who had observet 
only the adult model consistently adhering to a higl 
standard of self-reinforcement. Children were alsc 
more likely to impose severe criteria of self-rewarc 
on themselves when the adult model received socia 
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recognition for his high-standard-setting behavior 
than when the model’s stringent achievement de¬ 
mands went unrecognized. However, contrary to 
expectation, children who had experienced a highly 
nurturant interaction with the adult model were 
more likely to accept the low performance standard 
set by the peer than if the adult was less beneficent. 
A nurturant relationship was apparently interpreted 
by the children as permissiveness for lenient self¬ 
demands. 

Comparison of subgroups receiving various com¬ 
binations of treatments revealed that the influence 
o the peers liberal self-reward was effectively ne- 
gated by praising the adult's high-standard-setting 
behavior. The most austere pattern of self-reinforce¬ 
ment was displayed by children who experienced a 
relatively nonnurturant relationship with the adult 
model, who had no exposure to conflicting peer 
norms, and who witnessed the adult receive social 
recognition for holding to high standards. These 
children, who rarely considered performances that 
fell below the adult's criterion worthy of self-reward, 
displayed unyielding self-denial. The adoption and 
continued adherence to unrealistically high self- 
evaluative standards is especially striking considering 
that the self-imposition of rigorous performance de¬ 
mands occurred under conditions where the children 
were at liberty to reward themselves whenever they 
wished without anyone around to judge their actions. 
Moreover, since children rarely attained the modeled 
standard, adopting it as an index of personal merit 
resulted in repeated self-devaluation and self-forbid- 
dance of freely available rewards. 

In everyday life it is not uncommon for people to 
differ in what they practice and in what they preach. 
Some parents, for example, lead a frugal self-denying 
life but are lenient in what they demand of their 
children; others are self-indulgent while expecting 
their children to subscribe to high standards of 
achievement entailing long hours of work and sacri¬ 
fice of many day-to-day pleasures. A number of re¬ 
searchers have investigated the effects of such dis¬ 
crepancies on the development of self-reinforcement 
patterns [McMains & Liebert 1968, Mischel & Lie- 
bert 1966, Rosenhan, Frederick, & Burrowes 1968], 

, findings generally show that children reward 
themselves most sparingly when stringent standards 
have been consistently modeled and imposed where¬ 
as social learning conditions in which adults both 
model and impose lenient performance demands pro¬ 
duce children who reward themselves generously 
or mediocre attainments. Discrepant practices, on 
the other hand, in which models prescribe stringent 


standards for others but impose lenient ones upon 
themselves, or impose austere demands on them¬ 
selves and lenient ones on others, reduce the likeli¬ 
hood that high standards of self-reward will be 
adopted. 

The manner in which self-reward patterns may be 
passed on through a succession of models has been 
demonstrated by Mischel and Liebert [1966], Chil¬ 
dren who had adopted high standards of self-rein¬ 
forcement displayed by adults later modeled and ap¬ 
plied the same standards in relation to peers. Marston 
[1965a] has likewise shown in an experiment with 
adults that seeing models reinforce their perform¬ 
ances either generously or sparingly not only affected 
how liberally observers rewarded their own behavior 
but influenced the frequency with which they later 
reinforced another person performing the same task. 

The laboratory findings corroborate field studies 
demonstrating that in cultures where austerity is 
consistently modeled and taught as the dominant 
social norm people not only reward themselves spar¬ 
ingly, but because of the emphasis on personal re¬ 
sponsibility for high standards of conduct, self-deny- 
ing, self-punitive, and depressive reactions are pre¬ 
valent [Eaton & Weil 1955], By contrast, in societies 
where liberal self-gratification patterns predominate, 
people usually reward themselves generously for 
minimal performances [Hughes, Tremblay, Rapoport 
& Leighton I960], v 

Self-Reinforcement and Self-Concept 

Accounts of personality theories frequently draw 
sharp distinctions between phenomenological ap¬ 
proaches in which the self-concept is a central fea¬ 
ture and behavioral orientations that supposedly dis¬ 
miss self-evaluative phenomena. Behavior theories 
differ among themselves, of course, in what they 
choose to study and in what they regard as the 
nature and locus of the causes of human behavior. 

It is evident from the preceding discussion that self- 
evaluative and self-reinforcing functions assume a 
prominent role in social learning theory. It will be 
recalled that individuals who had been exposed to 
models favoring lenient standards of self-reinforce¬ 
ment were highly self-rewarding and self-approving 
for comparatively mediocre performances; converse¬ 
ly, persons who observed models adhering to strin¬ 
gent performance demands displayed self-denial and 
self-dissatisfaction for objectively identical accom¬ 
plishments. These contrasting self-reactions to one’s 
own behavior illustrate how self-esteem, self-concept, 
and related self-evaluative processes can be con- 
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ceptualized within a social learning framework From 
this perspective, self-esteem is the result of discrep¬ 
ancies between a person’s behavior and the standards 
that he has selected as indices of personal merit. 
When behavior falls short of one's evaluative stand¬ 
ards, the person judges himself negatively or holds 
himself in low self-esteem. On the other hand, when 
performances coincide with, or exceed, a person's 
standards he evaluates himself favorably, which is 
considered indicative of high self-esteem. 

The self-concept also reflects the phenomenon of 
self-reinforcement. Self-concept usually signifies a 
person’s tendency to regard different aspects of his 
behavior positively or negatively. In measuring this 
personality characteristic individuals are presented 
with a set of evaluative statements in the form of 
adjective check lists, Q-sorts, or inventories and are 
asked to rate which statements apply to them. The 
individual responses are then summed to provide a 
global index of self-evaluation, which represents his 
self-concept. Within a social learning approach, a 
negative self-concept is defined in terms of frequent 
negative self-reinforcement of one’s behavior; con¬ 
versely, a favorable self-concept is reflected in a dis¬ 
position to engage in high positive self-reinforcement. 
Marston [1965b] has conceptualized the phenomena 
subsumed under the term self-concept in much the 
same way. 

Personality theories often attribute variations in 
behavior to differences in values. Behavioral ap¬ 
proaches treat values largely in terms of incentive 
preferences. As we have already observed, the kinds 
of responses a person makes are affected by the posi¬ 
tive and negative outcomes they are likely to pro¬ 
duce. Traditional value theory ; however, is more 
concerned with valuation of behavior than of ex¬ 
ternal outcomes. Established personality theories 
tend to regard values as global personal entities that 
influence behavior but they fail to explain exactly 
how values control specific actions. Within the social 
learning framework, self-reinforcement is the con¬ 
trol mechanism. The behavioral standards represent 
the values and the anticipatory self-satisfaction and 
self-criticism for actions that correspond to or deviate 
from the adopted standards serve as the controlling 
influences. 

Dysfunctions in self-reinforcement systems often as¬ 
sume major importance in psychopathology through 
their capacity to create excessive self-punishment 
and aversive conditions that can maintain deleterious 
forms of deviant behavior. Many of the people who 
seek psychotherapy are behaviorally competent and 
free of debilitating anxiety, but they experience a 


great deal of personal distress stemming from exces¬ 
sively high standards of self-evaluation often sup¬ 
ported by unfavorable comparisons with models 
noted for their extraordinary achievements. As an 
unidentified pundit once remarked, “If you compare 
yourself with others, you may become vain or bitter; 
for always there will be greater and lesser persons 
than oneself ” Yet social comparisons are inevitable, 
especially in societies that place a high premium on 
competitiveness and individual achievement. Iron¬ 
ically, talented individuals who have high aspirations 
that are possible but difficult to realize are especially 
vulnerable to self-dissatisfaction despite their notable 
achievements. As Boyd [1969] graphically describes 
this phenomenon, “Each violinist in any second chair 
started out as a prodigy in velvet knickers who ex¬ 
pected one day to solo exquisitely amid flowers flung 
by dazzled devotees. The 45-year-old violinist with 
spectacles on his nose and a bald spot in the middle 
of his hair is the most disappointed man on earth.” 
Linus, the security-blanketed member of the Peanuts 
clan, also alluded to this phenomenon when he 
wisely observed, “There is no heavier burden than a 
great potential." 

In its more extreme forms, a harsh system of self¬ 
reinforcement gives rise to depressive reactions, 
chronic discouragement, feelings of worthlessness, 
and lack of purposefulness. Excessive self-disparage¬ 
ment, in fact, is one of the defining characteristics of 
psychotic depression. As Loeb, Beck, Diggory, and 
Tuthill [1967] have shown, depressed adults evalu¬ 
ate their performances as significantly poorer than 
do nondepressed subjects, even though their actual 
achievements are the same. People also suffei from 
considerable self-devaluation when they experience 
a loss in ability due to age or physical injury but 
continue to adhere to their original standards of 
achievement. In the latter instances, most of their 
performances are negatively self-reinforced to the 
point where they eventually become apathetic and 
abandon activities that previously brought them a 
great deal of personal satisfaction. 

When a person’s behavior produces self-punishing 
consequences, any activities that avert or reduce 
these disturbing effects are thereby strengthened and 
maintained. A variety of deviant behaviors can serve 
as means of escaping or avoiding self-generated dis¬ 
tress. Some people whose accomplishments bring 
them a sense of failure resort to alcoholic self¬ 
anesthetization; others escape into grandiose ideation 
where they achieve in fantasy what they failed in 
reality; others protect themselves against self-con¬ 
demnation for their lack of advancement by devel- 
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oping delusions of persecution; still others are trag¬ 
ically driven by relentless self-disparagement to 
suicide; and many renounce pursuits that have self- 
evaluative implications and gravitate to social groups 
that embrace an antiachievement norm. 

The preceding discussion portrays the peisonal 
misery that can result from stringent self-reinforce¬ 
ment. Social problems also arise from deficient or 
deviant self-reinforcement systems. Individuals who 
have failed to develop well-defined standards neces¬ 
sary for adequate self-regulating reinforcement and 
those who make self-reward contingent upon skillful 
performance of antisocial behavior readily engage in 
transgressive activities unless deterred by externally 
imposed controls. Similarly, individuals who set low 
behavioral standards for themselves are inclined to 
upset others by their indifference to achievement 
requirements. 


Behavior-Regulating Function of 

Self-Produced Consequences 

Psychologists have examined not only how people 
learn to respond to their own actions in self-reward¬ 
ing and self-punishing ways but the degree to which 
they can control their own behavior by self-produced 
consequences. Bandura and Perloff [1967] compared ' 
the relative effectiveness of self-monitored and exter¬ 
nally applied systems of reinforcement in an expert- 
ment at proceeded as follows; children worked at 
a manual task in which they could achieve progres- 

S ff C i V f ‘.' gher SC ° reS hy P erformin g increasingly more 
effortful responses. Eight complete rotations of a 
wheel were required to advance 5 points, so that, for 
example, a total of 16 cranking responses was re¬ 
quired to achieve a score of 10, 24 responses to attain 
a score of 15, and a total of 32 cranking responses to 
Ule ma ”mum score of 20. Children in the self- 
leinforcement condition selected which of these 
achievement levels they would strive for and re¬ 
warded themselves whenever they attained their self- 
prescribed standard of performance. Children who 
performed under externally administered reinforce¬ 
ment were individually matched with members in 
the self-reward group so that the same achievement 
standard was externally set for them and the rewards 
were automatically delivered whenever they reached 
e predetermined level. To determine whether sub¬ 
jects behavioral productivity was due to response 
reinforcement or to gratitude for the rewards that 
were made available, children in an incentive control 
group performed the task after they had received the 
rewards without any strings attached. A fourth group 


worked without any rewards at all to estimate the 
number of responses children would perform solely 
through interest in the task itself. Because the capac¬ 
ity to maintain effortful behavior is one of the most 
important features of a reinforcement operation, the 
dependent measure was the number of cranking re¬ 
sponses the children performed until they no longer 
wished to continue the activity. 

Children whose behavior was positively reinforced 
either by .themselves or by others performed substan¬ 
tially more responses than children who received the 
rewards in advance or were never rewarded. Girls 
were equally productive under externally and self- 
administered reinforcement. Although boys also 
worked hard when they rewarded their own per- 
ormances, they were even more responsive under 
conditions where reinforcement was externally regu- 

Of special interest is the prevalence with which 
children in the self-monitored condition willingly 
imposed upon themselves highly unfavorable sched¬ 
ules of reinforcement. Not a single child chose the 
lowest score, which required the least effort, while 
approximately half of them selected the highest 
achievement level as the minimal performance merit¬ 
ing self-reward. Moreover, a third of the children 
later raised their initial standard to a higher level 
without a commensurate increase in amount of self- 
reward, thereby requiring of themselves more work 
tor the same recompense. 

Why did children work themselves so hard when 
no one required them to do so? Since the experiment 
was not designed to provide evidence on this point, 
we can only speculate. It can be reasonably assumed 
that most older children have adopted standards of 
achievement through modeling and the evaluative 
reactions of others. Moreover, they are likely to have 
een criticized on many occasions for being self- 
satisfied with performances judged to be unworthy. 
Hence, under conditions where persons are provided 
with opportunities to optimize their material out¬ 
comes by resorting to behavior that has low self- 
regard value, conflicting tendencies are aroused. On 
the one hand, they are tempted to maximize rewards 
at minimum effort costs to themselves; they can 
achieve this by simply lowering their performance 
standards. On the other hand, reward for low-quality 
performances evokes self-reproof, which, if suffi¬ 
ciently strong, may inhibit undeserving self-compen¬ 
sation. Apparently, children were willing to deny 
themselves rewards over which they had full control 
rather than risk self-disapproval for unmerited self¬ 
reward. Many of the children, in fact, set themselves 
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performance requirements that incurred high effort 
costs at minimum material recompense. These find¬ 
ings are at variance with what one might expect on 
the basis of reward-cost theories, unless such formu¬ 
lations include the self-esteem costs of rewarding 
devalued behavior. 

In recent years psychologists have been developing 
self-reinforcement procedures that would enable peo¬ 
ple to control their own behavior more effectively. 
These methods are applied most extensively to the 
modification of behaviors that are personally disturb¬ 
ing or that create chronic difficulties for others. Given 
clearly defined objectives and some practice on how 
valued rewards can be self-administered contingently, 
children are able to manage their aggressive and 
scholastic behavior as well as or better than their 
teachers through self-reinforcement practices [Good- 
let & Goodlet 1969, Glynn 1970, Lovitt & Curtis 
1969]. Self-administered aversive consequences have 
been used with some degree of success to reduce 
stuttering, obsessional ruminations, craving for ad¬ 
dictive drugs, and deviant sexual behavior [Bandura 
1969a]. 

Recent investigations of techniques of self-control 
also assign a principal role to self-managed reinforce¬ 
ment [Ferster, Numberger, & Levitt 1962, Harris 
1969, Stuart 1967]. In such treatment programs 
changes in highly refractory behavior are induced by 
having people regulate the stimuli that ordinarily 
control undesired and competing response patterns. 
However, unless positive consequences for self-con¬ 
trolling behavior are also arranged, the well-inten¬ 
tioned practices are usually short-lived. 

Self-controlling behavior is difficult to maintain 
because it tends to be associated, at least initially, 
with relatively unfavorable conditions of reinforce¬ 
ment. Prepotent activities such as heavy drinking by 
alcoholics and excessive eating by obese people are 
immediately rewarding, whereas their detrimental 
consequences are not experienced for some time. 
Conversely, self-control measures usually produce 
immediate unpleasant effects while the personal 
benefits are considerably delayed. Self-reinforcement 
practices are, therefore, employed to provide im¬ 
mediate support for self-controlling behavior until 
the benefits that eventually accrue take over the rein¬ 
forcing function. This is achieved by having individ¬ 
uals select a variety of activities that they find 
rewarding and make them contingent upon the per¬ 
formance of desired behavior. 

The preceding studies primarily involve self-admin¬ 
istration of tangible reinforcers. Of considerable in¬ 
terest is evidence that imagined consequences can 


serve a reinforcing function in regulating overt be¬ 
havior. Weiner [1965] reports an experiment in which 
inappropriate motor responses by adults were either 
punished by withdrawal of monetary points or by 
having the subjects imagine the same loss of mone¬ 
tary points, or their performances had no conse¬ 
quences. He found that imagined aversive con¬ 
sequences and the actual occurrence of the same 
negative outcomes both reduced responding com¬ 
pared to the condition involving no feedback. Covert 
self-punishment, however, produced somewhat weak¬ 
er reductive effects. 

Although experimental demonstrations are lacking, 
there is every reason to expect that people can in¬ 
crease desired behavior by covert self-reward. The 
operation of covert self-reinforcement complicates 
interpretation of behavior changes accompanying ex¬ 
ternal reinforcements. It may not be the lights, the 
scores, and the evaluative statements used as re¬ 
wards in psychological experiments that are reinforc¬ 
ing; rather, such extrinsic events serve as cues that 
elicit covert self-satisfaction or self-criticism in sub¬ 
jects. It would follow from this line of reasoning that 
correctness feedback on tasks that are personally de¬ 
valued or regarded as trifling is unlikely to activate 
self-reward and hence will not operate as a positive 
reinforcement. On the other hand, behavioral im¬ 
provements are possible in the absence of extrinsic 
rewards because people can easily supply them to 
themselves. Preliminary findings of studies of self¬ 
reinforcement processes suggest that learning ap¬ 
proaches hold considerable promise for increasing 
people’s capacity to regulate their own feelings, 
thoughts, and actions. 

Conditions Supporting 
Self-Reinforcement Systems 

Since self-reinforcement systems are developed 
through social influence, there is no reason to assume 
that, once established, they become autonomous reg¬ 
ulators completely impervious to subsequent social 
influences. An interesting, but inadequately explored, 
question is what supports self-reinforcing actions. 
No elaborate theory is needed to explain why people 
reward themselves. The challenging question requir¬ 
ing explanation is why people deny themselves avail¬ 
able rewards over which they have full control, why 
they adhere to exacting standards that require diffi¬ 
cult performances, and why they punish themselves. 
Several different interpretations have been proposed. 

Conditioned relief. In the view advanced by 
Aronfreed [1964], people punish themselves because 
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such responses have become endowed with anxiety 
relief value through prior conditioning experiences. 
This classical conditioning interpretation assumes 
that when parents discipline their children they often 
voice their criticism as they cease punishing them. If 
verbal criticism is repeatedly associated with the 
termination of punishment, criticism eventually be¬ 
comes a relief signal indicating that punishment will 
soon end, thus allaying anxiety. Thereafter, when 
transgressive behavior arouses anticipatory fear, peo¬ 
ple criticize themselves for its conditioned tranquil- 
izing effects. Self-critical responses persist, according 
to Aronfreed, because they are automatically rein¬ 
forced by anxiety reduction. 

To test this theory, Aronfreed conducted an exper¬ 
iment in which children performed an ambiguous 
taslq on periodic occasions a buzzer sounded indi¬ 
cating that they had erred, at which point they were 
reprimanded for behaving the "blue" way and de¬ 
prived of some candy. For one group the critical 
label “blue was uttered as the buzzer and punish¬ 
ment were terminated; for a second group the label 
coincided with the onset of the buzzer and punish¬ 
ment; while with control children the blue label was 
verbalized as the buzzer was turned off without any 
accompanying punishment. On subsequent trials, 
during which the buzzer signaled a transgression,’ 
children who experienced labeling at the termination 
of punishment were more inclined to verbalize the 
critical label than either the controls or children re¬ 
ceiving labeling at onset of punishment, who did not 
differ from each other. 

These findings are consistent with a conditioned 
reinforcement view, but other aspects of the data 
cast doubt on this interpretation. After transgressing, 
the children rarely uttered the critical label on their 
own and did so only after the punisher prompted it 
*rom them through a series of questions concerning 
their actions. Given anxiety arousal, one would ex¬ 
pect an anxiety reducer to be performed quickly and 
spontaneously. Why endure discomfort if one can 
promptly relieve it by a soothing self-critical word? 
The children’s initial reluctance but later differentia] 
use of the critical label can be more adequately ex¬ 
plained in terms of its assumed functional value 
rather than its conditioned tranquilizing properties. 
Children for whom the critical label brought on pun¬ 
ishment would have little reason to use it. On the 
other hand, those who had earlier observed that the 
critical verbalization terminated punishment would 
e inclined to try it as a defensive maneuver in the 
face of the punishers probing. Once having seen that 
uttering the critical word apparently eliminated at 
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least the punisher s verbal reprimand, children would 
tend to repeat it for its presumed instrumental value 
Moreover, when children did use the blue label on 
signaled errors, it remains unclear whether they were 
simply reporting that they performed the “blue" way 
or whether they were in fact voicing self-criticism. 
An adequate test of the conditioning view of self- 
criticism, therefore, requires a situation in which 
children are provided with opportunities to actually 
criticize their actions in the absence of the punisher, 
thereby removing any possible external gains for 
such behavior. It might be noted in passing that the 
conditioning theory would also require several com¬ 
plicated assumptions to explain how people adopt 
self-punishing responses by observing punishments 
self-administered by a model for devalued behavior 
without observers receiving any direct painful treat¬ 
ment. 

Reduction of self-generated distress . When a per¬ 
son performs inadequately or violates his own stand¬ 
ards* of conduct, he is likely to experience self- 
deprecatory and other types of distressing thoughts. 
During the course of socialization the sequence of 
transgression—internal distress—punishment—relief is 
repeatedly experienced. In this process performance 
of punishable behavior creates anticipatory fears that 
often persist in varying degrees until the person is 
reprimanded. Punishment not only terminates wor¬ 
ries over discovery of the transgression and possible 
social condemnation, but it also tends to restore the 
favor of otliers. Thus, punishment can provide relief 
from thought-produced anguish that is enduring and 
often more painful than the actual reprimand itself. 
This phenomenon is most vividly illustrated in ex¬ 
treme cases where people torment themselves for 
years over relatively minor transgressions and do not 
achieve equanimity until after making reparations of 
some type. Self-punishment may serve a similar dis¬ 
tress-relief function. Having criticized or punished 
themselves for undesirable actions, individuals are 
likely to discontinue further upsetting ruminations 
about their behavior. 

The way in which self-punishment can be main¬ 
tained by averting anticipated threats is strikingly 
demonstrated by Sandler and Quagliano [1964]. 
After monkeys learned to press a lever to avoid being 
shocked, a second contingency involving self-admin¬ 
istered punishment was introduced. A lever press 
prevented the occurrence of the original shock, but 
it also produced an electric shock of lesser magni¬ 
tude. As the experiment progressed, the self-admin¬ 
istered shock was gradually increased in intensity 
until it equaled the one being avoided. However, the 
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animals showed no reduction in self-punishment even 
though this^ behavior no longer served as a lesser 
of two evils.” Even more interesting, after the avoided 
shock was permanently discontinued but lever-press¬ 
ing responses (which had now become objectively 
functionless) still produced painful consequences, 
the animals continued to punish themselves need¬ 
lessly with shock intensities that they had previously 
worked hard to avoid. This experiment reveals how 
self-punishment can become autonomous of contem¬ 
poraneous conditions of reinforcement and be main¬ 
tained through its capacity to forestall imagined 
threats that no longer exist. 

In psychotic disorders, self-punishment is often 
powerfully maintained by delusional contingencies 
that have little relationship to reality. In a case to be 
cited later, a man who judged trivial acts as heinous 
sins could relieve his fright of hellish torment and 
feelings of self-contempt only by performing exceed¬ 
ingly self-punitive behaviors for long hours. 

The preceding analysis of self-punishment can be 
applied as well to self-disappointing performances 
as to moral conduct. Like transgressive behavior, 
inferior performances can be a source of disconcert- 
ing thoughts and social disapproval that individuals 
will strive to reduce by criticizing or punishing them¬ 
selves. 

External reinforcement. Although self-punishment 
can be reinforced by its capacity to end or at least to 
reduce thought-produced distress, self-reinforcing re¬ 
sponses are partly sustained by periodic external 
reinforcement. Adherence to high standards of self- 
rcinforcement is actively supported through a vast 
societal system of rewards involving praise, social 
recognition, and a variety of awards and honors, 
whereas few accolades are bestowed on people for 
rewarding themselves on the basis of mediocre per¬ 
formances. To the extent that people choose a refer¬ 
ence group whose members share similar behavioral 
norms for self-reinforcement, a given individual's 
self-evaluations are undoubtedly influenced by the 
actual or anticipated reactions of members whose 
judgments he values. When a person’s immediate 
reference group is small and select, his self-evalua¬ 
tions are not much influenced by the views of others, 
and he sometimes appears to be an “inner-directed" 
person [Riesman 1950], although, in fact, he is highly 
responsive to a few individuals whose good opinion 
he prizes. The man is rare who regards his behavior 
so highly that the reactions of his fellows have no 
effect on his self-evaluation. 

In everyday life high evaluative standards are not 
only favored, but negative sanctions are frequently 


applied to discourage inappropriate positive self- 
reinforcement. Rewarding oneself for inadequate or 
undeserving performances is more likely than not to 
evoke critical reactions from others. Similarly, lower¬ 
ing one’s performance standards is rarely considered 
praiseworthy. 

Self-punishment often serves as an effective means 
of lessening negative reactions from others. When 
misdeeds are almost certain to evoke disciplinary 
actions, self-punishment may be the lesser of two 
evils. Stone and Hokanson [1969] show how self- 
punitive behavior can be effectively maintained by 
its self-protective and emotion-reducing function. 
When adults could avoid painful shocks by admin¬ 
istering to themselves shocks of lesser intensity, self- 
punitive responses not only increased but were ac¬ 
companied by reduction in autonomic arousal. 

Finally, verbal self-punishment can be a good way 
of extracting commendations from others as well. By 
criticizing and belittling themselves, people can pre¬ 
dictably get others to enumerate their noteworthy 
accomplishments and abilities, and to issue reassur¬ 
ing predictions that continued effort will produce 
future triumphs. In summary, self-reinforcing behav¬ 
ior is intermittently reinforced by both subjectively 
created contingencies and various external sources. 

Cognitive Control 

If human behavior could be fully explained in 
terms of external stimulus conditions and response 
consequences, there would be no need to postulate 
any additional regulatory mechanisms: Actions are 
not always predictable from these external sources 
of influence, however, because cognitive factors part¬ 
ly determine what one observes, feels, and does at 
any given moment. In the present discussion, cogni¬ 
tive events refer to imager)', to representations of ac¬ 
tivities in verbal and other symbols, and to thought 
processes. There are several wavs in which cognitive 
functioning enters into the regulation of human be¬ 
havior. These are discussed next. 

Cognitive Representation of 
Reinforcement Contingencies 

It was previously shown that repeated paired stim¬ 
ulation generally fails to produce conditioned re¬ 
sponses as long as the connection between stimulus 
events goes unnoticed. Response consequences simi¬ 
larly have little or no effect on behavior when the 
relationship between one’s actions and outcomes is 
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not recognized. On the other hand, cognitive repre¬ 
sentation of conditions of reinforcement typically 
results in abrupt improvements in performance, in¬ 
dicative of insightful functioning. 

Another interesting way of analyzing the process 
of cognitive control is to pit the power of belief 
against experienced reinforcement in the regulation 
of behavior. Several researchers have systematically 
explored the degree to which cognitive influences 
attenuate, distort, or nullify the effects of response 
consequences. Kaufman, Baron, and Kopp [1966] 
conducted a study in which motor responses of 
adults were rewarded each minute on the average 
(variable-interval schedule). One group was cor¬ 
rectly informed about how often their performances 
would be rewarded, whereas other groups were mis¬ 
led into believing that their behavior would be rein¬ 
forced either every minute (fixed-interval schedule) 
or after they had performed 150 responses on the 
average (variable-ratio schedule). Beliefs about the 
prevailing conditions of reinforcement outweighed 
the influence of experience consequences. Although 
all subjects actually received the same pattern of 
reinforcement, those who thought they were being 
rewarded once every minute produced very low rates 
of response (mean = 6); those who thought they 
were reinforced on a variable-ratio schedule main¬ 
tained an exceedingly high output (mean = 259); 
while those who were correctly informed that their 
behavior would be rewarded on the average ever)' 
minute displayed an intermediate level of responsive¬ 
ness (mean = 65). 

Human behavior is regulated to a large extent by 
anticipated consequences of prospective actions. In¬ 
dividuals may accurately assess the customary effects 
o given activities but fail to act in accordance with 
existing conditions of reinforcement because of false 
hope that their actions may eventually produce favor¬ 
able outcomes. In one study some children persisted 
in imitating nonfunctional responses of a model that 
were never reinforced in the erroneous belief that 
their continued imitativeness might change his rein¬ 
forcement practices [Bandura & Barab 1971). People 
also lead themselves astray by inaccurate expecta¬ 
tions when they wrongly assume that certain changes 
in their behavior will alter future response conse- 
quences. 

The oft-repeated dictum that behavior is controlled 
by its immediate consequences holds up better under 
dose scrutiny for anticipated consequences than for 
those that actually impinge on the organism. In most 
instances customary outcomes are reasonably good 
predictors of behavior because the consequences that 


people anticipate for their actions are accurately de¬ 
rived from, and therefore correspond dosely to, pre¬ 
vailing conditions of reinforcement. Belief and actu¬ 
ality, however, do not always coincide, because an¬ 
ticipated consequences are also partly inferred from 
observed response consequences of others, from what 
one reads or is told, and from a variety of other cues 
that, on the basis of past experiences, are considered 
reliable forecasters of likely outcomes. When actions 
are guided by anticipated consequences derived from 
predictors that do not accurately reflect existing con¬ 
tingencies of reinforcement, behavior is weakly con¬ 
trolled by its actual consequences until cumulative 
experiences produce more realistic expectations. 

In some of the more severe behavior disorders, 
psychotic actions are so powerfully controlled by 
bizarre subjective contingencies that the behavior 
remains unaffected by its external consequences. This 
process is vividly illustrated in the passages quoted 
below [Bateson 1961], taken from a patients account 
of his psychotic experiences in an insane asylum dur¬ 
ing the early 1800’s. The narrator had received a 
scrupulously moralistic upbringing, according to 
which actions ordinarily viewed as fully acceptable 
were judged by him to be deviant, sinful, and likely 
to provoke the wrath of God; consequently, many 
innocuous acts elicited dreadful apprehensions, which 
in turn motivated and maintained exceedingly pain¬ 
ful atonement rituals designed to forestall the im¬ 
agined disastrous consequences. 

In the night I awoke under the most dreadful impres¬ 
sions; I heard a voice addressing me. and I was made 
to imagine that my disobedience to the faith, in taking 
the medicine overnight, had not only offended the 
Lord, but had rendered the work of my salvation ex¬ 
tremely difficult, by its effect upon my spirits and 
humours. I heard that I could only be saved now by 
being changed into a spiritual body. ... A spirit came 
upon me and prepared to guide me in my actions. I 
was lying on my back, and the spirit seemed to light 
on my pjHw by my right ear, and to command my 
body. I was placed in a fatiguing attitude, resting on 
my feet, my knees drawn up and on my head, and 
made to swing my body from side to side without 
ceasing. In the meantime, I heard voices without and 
within me, and sounds as of the clanking of iron, and 
the breathing of great forge bellows, and the force of 
names. ... I was told, however, that my salvation de¬ 
pended on my maintaining that position as well as I 
could until the morning; and oh! great was my joy 
when I perceived the first brightness of the dawn, 
which I could scarcely believe had arrived so early 
1pp. 28-29]. 7 

In the experiences described above, the anxiety 
instigators and the influential response consequences 
are both internally created. Acceptance of medicine, 
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an act later considered a rebellion against, and mis¬ 
trust of, the Almighty, aroused dreadful hallucina¬ 
tions of Hadean torture, which could be banished 
only by performance of arduous bizarre behavior. 
The nonoccurrence of subjectively feared, but ob¬ 
jectively nonexistent, threats undoubtedly serves as 
an important source of reinforcement, maintaining 
many other types of psychotic behavior. Given fic¬ 
tional contingencies with a powerful internal rein¬ 
forcing system, a person's behavior is likely to remain 
under very poor environmental control even in the 
face of severe external punishments and blatant dis- 
confirming experiences. 

When I opened the door, I found a stout man servant 
on the landing, who told me that he was placed there 
to forbid my going out, by the orders of Dr. P. and 
my friend; on my remonstrating, he followed me into 
my room and stood before the door. I insisted on go¬ 
ing out; he, on preventing me. I warned him of the 
danger he incurred in opposing the will of the Holy 
Spirit, I prayed him to let me pass, or otherwise an 
evil would befal him, for that I was a prophet of the 
Lord. He was not a whit shaken by my address, so, 
after again and again adjuring him, by the desire of 
the Spirit whose word I heard, I seized one of his 
arms, desiring it to wither; my words were idle, no 
effect followed, and I was ashamed and astonished. 

Then, thought I, I have been made a fool of! But I 
did not on that account mistrust the doctrines by 
which I had been exposed to this error. The doctrines, 
thought I, are true; but I am mocked at by the Almighty 
for my disobedience to them, and at the same time, I 
have the guilt and the grief, of bringing discredit upon 
the truth, by my obedience to a spirit of mockery, or, 
by my disobedience to the Holy Spirit; for there were 
not wanting voices to suggest 'to me, that the reason 
why the miracle had failed, was, that I had not waited 
for the Spirit to guide my action when the word was 
spoken, and that I had seized the man's arm with the 
wrong hand [p. 331. 

The voices informed me, that my conduct was owing 
to a spirit of mockery and blasphemy having possession 
of me . . . that I must, in the power of the Holy Spirit, 
redeem myself, and rid myself of the spirits of blas¬ 
phemy and mockery that had taken possession of me. 

The way in which I was tempted to do this was by 
throwing myself on the top of ray bead backwards, and 
so resting on the top of my head and on my feet alone, 
to turn from one side to the other until I had broken 
my neck. I suppose by this time 1 was already in a 
state of feverish delirium, but my good sense and pru¬ 
dence still refused to undertake this strange action. I 
was then accused for faithlessness and cowardice, of 
fearing man more than God. 

I attempted the command, the servant prevented me. 

1 lay down contented to have proved myself willing to 
obey in spite of his presence, but now I was accused 
of not daring to wrestle with him unto blows. I again 


attempted what I was enjoined. The man seized me, I 
tore myself from him, telling him it was necessary for 
my salvation; he left me and went down stairs. I then 
tried to perform what I had begun; but now I found, 
either that I could not so jerk myself round on my 
head, or that my fear of breaking my neck was really 
too strong for my faith. In that case I then certainly 
mocked, for my efforts were not sincere. 

Failing in my attempts, I was directed to expectorate 
violently, in order to get rid of my two formidable 
enemies; and then again I was told to drink water, and 
that the Almighty was satisfied; but that if I was not 
satisfied (neither could I be sincerely, for I knew I had 
not fulfilled his commands), I was to take up my posi¬ 
tion again; I did so; my attendant came up with an 
assistant and they forced me into a straight waistcoat. 
Even then I again tried to resume the position to which 
I was again challenged. They then tied my legs to the 
bed-posts, and so secured me (pp. 34-35], 

Grotesque homicidal actions provide other strik¬ 
ing illustrations of how behavior can come under 
bizarre symbolic control. A study of presidential 
assassins [Weisz & Taylor 1970], for instance, shows 
that, with one exception, the murderous assaults were 
partly under delusional control. The assassins were 
driven to acts of violence by divine inner voices 
commanding them to do so, by delusional beliefs 
that the victim was conspiring to harm them, and by 
grandiose convictions that it was their heroic respon¬ 
sibility to eliminate maleficent leaders in positions 
of power. The assassins, being unusually seclusive in 
their behavior, shielded the delusional beliefs from 
corrective social influences. 

Representational Guidance of Behavior 

Symbolic processes play a prominent role in the 
acquisition and retention of response patterns as well 
as in their expression. The memory trace of monetary 
influences is short-lived, but such experiences often 
have lasting behavioral effects. This condition is 
made possible by the fact that transitory external 
events are coded and stored in symbolic form for 
memory representation. Patterns of behavior that 
have been observed and other experiences long past 
can thus be reinstated by visualizing them or by 
representing them verbally. These internal models 
of the outside world can serve as guides to overt 
action on later occasions. It will be recalled from the 
earlier discussion of learning processes that internal 
representations of patterned behavior are constructed 
from observed examples and from informative feed¬ 
back to one's trial-and-error performances. 

Representational mediators are especially influen¬ 
tial in early phases of response acquisition. After ac- 
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tion patterns have become routiiiized through over- 
learning, they are usually performed smoothly and 
automatically without requiring intermediary imag- 
inal or verbal guidance. Skilled performance can in 
fact, be disrupted by visualizing or thinking about 
what one must do next while carrying out the activity. 

Thought Control of Action through 

Covert Problem Solving 

Man’s efforts to understand and to manage his 
environment would be exceedingly wearisome, and 
perilous as well, if optimal solutions to problems 
could be arrived at only by performing alternative 
actrons and suffering the consequences. Fortunately, 
most problem solving occurs in thought rather than 
in action. Man's higher mental capacities, for exam¬ 
ple, enable him to design sturdy dwellings and 
bridges without having to build them until he hits 
upon a structure that does not collapse. Alternative 
courses of action are generally tested in symbolic 
exploration and either discarded or retained on the 
basis of calculated consequences. The best symbolic 
solution is then executed .in action. 

Symbols that represent external events, operations, 
and relationships are the vehicle of thought. Most 
thinking occurs in terms of language symbols. By 
manipulating words that convey relevant informa¬ 
tion, one can gain an understanding into causal prec¬ 
edes, arrive at solutions, and deduce consequences. 
Thinking also occurs in numerical and other symbols. 
The functional value of thought depends upon close 
correspondence between the symbolic system and 
external events so that the latter can be substituted 
for the former. Thus, subtracting the number 2 from 
0 yields the same outcome as physically performing 
he operation of removing two objects from a group 
of ten. Symbols can be manipulated much more easily 
than their physical counterparts, which greatly in¬ 
creases the scope and power of symbolic problem 
solving. Since symbols are the instruments of thought, 
the level of symbolization development partly deter¬ 
mines reasoning capacities, 

The process by which people learn to attain solu¬ 
tions covertly has received comparatively little atten- 
taon, despite its focal role in human functioning. 
Being a private activity, of course, it is not readily 
accessible to empirical investigation. Such cognitive 
skills are usually developed by performing operations 
initially on actual objects and then gradually trans¬ 
lating the external processes to covert symbolic ones 
of increasing complexity and abstraction. In the 
teaching of arithmetic principles, for instance, chil¬ 
dren first learn the formal operations of addition and 
subtraction by combining and withdrawing real ob¬ 


jects. Pictorial representations are also used in early 
phases as aids to acquiring arithmetic skills. After 
children have learned to solve arithmetic problems 
through physical manipulation of things, the objects 
are symbolized by numbers. Correct solutions are 
now achieved by manipulation of numerical symbols 
on paper, where each step can be checked and cor¬ 
rected. The process at this stage is still partially 
overt but the solutions are symbolic. Eventually the 
symbolic solutions are achieved covertly by having 
children think out the problem wi thin their heads. 
Overt and covert operations do not differ in any fun¬ 
damental way, although covert problem solving gen¬ 
erally makes heavy demands on memory processes. 
For tliis reason various diagrammatical and mechani¬ 
cal aids are used when solutions to problems require 
complex chains of symbolic activities. 


Interaction of Controlling Influences 

The three major systems by which behavior is 
regulated do not operate independently. Most actions 
are simultaneously controlled by two or more of the 
component influences. Moreover, the various systems 
are closely interdependent in acquiring and retaining 
their power to determine behavior. In order to estab- 
ish and to maintain effective stimulus control, for 
example, the same actions must produce different 
consequences, depending on the cues that are pres¬ 
ent. If walking through intersections on red or green 
signals left one equally vulnerable to being knocked 
down by automobiles, pedestrians would quickly dis¬ 
regard traffic lights and rely on other informative 
cues to guide them safely through busy thorough¬ 
fares. Earlier we noted how the effectiveness of 
verbal and other social influences is negated by faulty 
reinforcement practices and reinstated by ensuring 
that predictable consequences will ensue for respond¬ 
ing in a particular wav to certain stimulus situations. 

The preceding comments illustrate how stimulus 
determinants of behavior can be markedly affected 
by how they are correlated with reinforcing conse¬ 
quences. Stimulus and cognitive influences, in turn, 
can alter the impact of prevailing conditions of re¬ 
inforcement. Conditioned aversive stimuli can ac¬ 
quire such powerful control over defensive behavior 
that people avoid renewed encounters with feared 
persons or things. In instances where the original 
threats no longer exist, their self-protective behavior 
is insulated from realistic reinforcement control. 

Even when the things one dislikes or fears are not 
completely avoided, stimuli having strong emotion- 
arousing potential provoke defensive behaviors that 
predictably create adverse reinforcement contingen- 
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cies where they may not ordinarily exist. We‘might 
draw again on the advice column of a newspaper 
counselor to illustrate this process. 

Dear Abby: 

I have trouble with blondes. Every time I go for a girl 
and she is a blonde she turns out to be a gold-digger. 
I notice on TV whenever they have a gold-digger she 
is blonde. The last blonde 1 went with asked me to 
buy a record every time I took her out. She kept me 
busted buying her records. Should I pass up all blondes 
from now on? 

Blonde Trouble 

Dear Blonde Trouble: 

Plenty of golden heads have golden hearts. 

Abby 

To the extent that the correspondents distrust of 
blondes leads him to behave in ways that provoke 
unfriendly counteractions from them, the negative 
valence of blonde hair is repeatedly strengthened, and 
it in turn prompts actions that produce reciprocal 
negative reinforcement. Both processes thus support 
each other. 

The way in which beliefs and conscious recog¬ 
nition of environmental contingencies can enhance, 
distort, or even negate the influence of reinforcing 
consequences has already been amply documented 
and needs no further illustration. Cognitive events, 
however, do not function as autonomous causes of 
behavior. Their nature, their valence, and their oc¬ 
currence are under stimulus and reinforcement con¬ 
trol. Analysis of cognitive control of behavior is 
therefore incomplete without specifying what con¬ 
trols the influential cognitions. 

Cognitively based conditioning, for example, can¬ 
not occur unless the thoughts that serve as sources of 
emotional responses have been endowed with arousal 
potcnial. The research of Miller [1951] and Grose 
[1952] demonstrates that thoughts can acquire 
emotion-provoking properties through generalization 
from reinforcing experiences associated with overt 
responses. In these studies it was found that thoughts 
corresponding to verbalizations that had been pun¬ 
ished generated physiological arousal, whereas 
thoughts representing nonpunished verbalizations 
elicited no emotional reactions. If the painful experi¬ 
ences are sufficiently intense, however, they can con¬ 
dition such potent aversive properties that the dis¬ 
turbing thoughts themselves are completely inhibited 
[Eriksen & Kuethe 1956, Marks & Gelder 1967]. In 
personality theories, thought inhibition is usually 
designated as repression. 

Thoughts remain partly under external stimulus- 
control. Thus, the thoughts elicited in a hospital are 


markedly different from those aroused in a disco¬ 
theque. A simple cue from a past experience can re¬ 
instate reveries of bygone events. And perturbing 
trains of thought can be turned off by directing one’s 
attention to absorbing matters that elicit superseding 
cognitive activities. This form of self-control, in 
which thought-produced arousal is diminished by 
engrossment in absorbing literary material, televised 
programs, vocational and avocational pursuits, and 
other engaging projects, is widely used to restore a 
sense of well-being. 

The rules and principles that people use to guide 
their actions do not arise in a mental vacuum. When 
rules defining appropriate behavior in given situ¬ 
ations are not explicitly designated, they are arrived 
at largely through information conveyed by observed 
or experienced response consequences. Provisional 
hypotheses that produce responses resulting in rein¬ 
forcement are retained, partially correct hypotheses 
are successively refined on the basis of differential 
response feedback until the right one is hit upon, and 
hypotheses that give rise to faulty performances are 
promptly discarded. While it is true that implicit 
rules govern behavior, the rules themselves are partly 
fashioned oul of reinforcement experiences. 

Considering tire intricate interdependence of stim¬ 
ulus, reinforcement, and cognitive control systems, 
the sharp distinctions frequently drawn between re¬ 
inforcement and cognitive theories are of question¬ 
able value. It has been customary in psychological 
- theorizing to construct entire explanatory schemes 
around a single form of behavioral control, to the 
relative neglect of other obviously influential de¬ 
terminants and processes. Some theories have tended 
to concentrate upon stimulus control effected princi¬ 
pally through association of experiences; others have 
primarily focused upon external reinforcement con¬ 
trol; and advocates favoring cognitive interpreta¬ 
tions confine their interest largely to symbolic 
mediators. These resolute allegiances to part proc¬ 
esses may encourage intensive investigation of sub¬ 
systems, but considered alone, they do not provide a 
complete understanding of human behavior. 

Social Learning as a Reciprocal Influence Process 

In the social learning view, psychological function¬ 
ing involves a continuous reciprocal interaction be¬ 
tween behavior and its controlling conditions. Early 
attempts to incorporate both individual and environ¬ 
mental determinants in personality theory simply de¬ 
picted behavior as caused by these two sets of influ¬ 
ences. In the widely cited equation B = f(P } E), ac¬ 
tions were presumably best understood by consider- 


39 



Albert Bandura 


ing the joint effects of personal attributes and envi¬ 
ronmental pressures. The major weakness of this 
type of formulation is that it treats response disposi¬ 
tions and the environment as independent entities. 
Contrary to this assumption, the environment is only 
a potentiality, not a fixed property that inevitably 
impinges upon individuals and to which their behav¬ 
ior eventually adapts. Behavior partly creates the 
environment and the resultant environment, in turn 
influences the behavior. In this two-way causal proc¬ 
ess the environment is just as influenceable as the 
behavior it controls. 

Illustrations of how behavioral disposition and the 
environment affect each other can be found even in 
simple experiments with animals. To study defensive 
reactions to threats, Sidman [1966] devised a situ¬ 
ation in which animals could postpone the occurrence 
of painful shocks for a given period by depressing a 
lever. Shocks, for example, may be scheduled to 
occur every minute, but each bar press forestalls it 
tor thirty seconds so that animals have an effective 
means for controlling the punitiveness of their en¬ 
vironment. Under these conditions animals that 
quickly leam the adaptive behavior create for them¬ 
selves an environment that is essentially free of 
punishment. Others who, for one reason or another, 
are slow in acquiring the requisite coping responses 
experience a highly unpleasant milieu. Though the 
potential environment is identical for all animals the 
actuai environment varies in accordance with their 
behavior. Is the animal controlling the environment 
or is the environment controlling the animal? What 
we have here is a two-way control system, and it is 

arbitrary which aspect of wg choos£ tQ record 

When changes in the animals’ behavior are selected 
for analysis, the environmental contingencies appear 
to be the controllers of behavior. If, instead, one 
examines the amount of punishment created by each 
subject it is the environment that is controlled and 
modified by the animals' actions, and it can there¬ 
fore vary considerably for different subjects and at 
different times for the same subject. In examining 
how behavior determines the environment, one 
might, for instance, administer alcohol to one group 
and water to another within the same programmed 
milieu and then compare the types of aversive en¬ 
vironments subjects create for themselves under 
intoxicated and under sober conditions. 

The rewarding aspect of an environment is also 
onfy a potentiality until actualized through appropri¬ 
ate actions. A researcher once studied the behavior 
of schizophrenic and normal children in a room con¬ 
taining an extraordinary variety of reinforcing de¬ 


vices including, among other things, a pinball ma¬ 
chine, a color wheel, a television set, a phonograph 
an electric train, a picture viewer, an electric organ’ 
and candy- and trinket-vending machines. To actuate 
these various playthings, children had simply to de¬ 
posit available coins, but only when a light was 
fumed on; coins deposited when the light was off 
extended the period that the device would remain 
inoperative. Normal children rapidly learned the 
requisite behavior and created an unsually beneficent 
environment for themselves. On the other hand, the 
schizophrenic children, who failed to master this 
simple skill, experienced the same potentially re¬ 
warding environment as a depriving and emotionally 
disturbing place. 

In the preceding examples, the features of the po¬ 
tential environment were highly restricted. Social 
environments, on the other hand, provide a much 
peater latitude for creating contingencies that can 
in turn affect one’s own behavior. People can con¬ 
verse on a multiplicity of topics, they can do a 
variety of things, and in other ways their potential 
range of responsiveness is exceedingly broad. In a 
given social interaction the behaviors of the partici¬ 
pants largely determine which aspects of their re¬ 
spective repertoires are actualized and which remain 
unexpressed. Some people bring out the best in 
others, while other individuals have a talent for 
bringing out the worst in those with whom thev 
interact. 

In examining sequential interchanges between 
children, Rausch [ 1965] found that the immediately 
preceding act of one person was the major determi¬ 
nant of the other person’s response. In approximately 
75 per cent of the instances, hostile behavior elicited 
unfriendly responses, whereas cordial antecedent acts 
seldom did. Aggressive children thus created through 
their actions a hostile environment, whereas children 
who displayed friendly interpersonal modes of re¬ 
sponse generated an amicable social milieu. With 
little effort we could readily identify problem-prone 
individuals with aversive styles of behavior that pre¬ 
dictably produce negative social climates wherever 
t ey go. Far from being ruled by an imposed environ¬ 
ment, people play an active role in constructing their 
own reinforcement contingencies through their char- 
acteristic modes of response. 

It might be argued that if each person partly 
creates his own environment there is no one remain¬ 
ing to be influenced. This apparent paradox over¬ 
looks the fact that reciprocity is rarely perfect be¬ 
cause one’s behavior is not the sole determinant of 1 
subsequent events. Situational factors, the roles that 
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* people occupy, and other considerations partly de- 

W , at °“ e can 01 cannot do in response to the 
actions of others. Furthermore, controlling and 

influences usually occur in an altemat- 
g pattern, rather than concurrently, until each par¬ 
ticipant gets what he wants. P 

The operation of reciprocal reinforcement proc- 

Th “tf** Pr ° ducti0n ° f troublesome 
So ^ Point On most oc- 

c^ons ehddren s mild requests go unheeded because 
tlie parent is disinterested or preoccupied with other 
activities If further bids also go unrewarded, the 
S2 generally display progressively more in- 
nse behavior that becomes increasingly aversive to 

Se C S Stage “ “faction sequence 

the child is exercising aversive control over the 

Eoubl VentUall K y £' Par6Dt iS f0rced ‘^inate 
e troublesome behavior by attending to the child 

sLce the chid dng 0bSt7eperO,1S riponsiveness. 

b " n ‘ l!d galnS par< f al atte ntion and the parent 
gan, temporary peace the behavior of both partici- 

Cfit ne r 31 ? V alth ° Ugh tKe i° n g‘temi effects 
enefit neither. As shown in this example, detrimental 

reciprocal systems are often unknowingly created 
and mutuaUy tamed when 

hces evoke deviant behavior, which, due to‘its 
to 6S ’ “ the ^ cond ttions likely 

Interpersonal difficulties are most likely to arise 
hen a person has developed a narrow range of ef- 

hods to force desired actions from others. Nag- 
g ng complamts aggressiveness, thinly veiled threats 
he^lessness sick-role behavior, and emotional ex 
pressions of rejection, suffering, and distress are 

» T anS 0f , COntr0lJm 8 °tflers, especially if 
«e relationship involves some mutual dependence 

s^rt dr"" £ ° ne *"* -nditions ofthTs 
behavi T substan t>ally, depending on whether such 

SZn 3re V,eWed aS b ^P r0duch Of intrapsychic 
disturbances or m terms of their functional value in 

reciprocal'I 6 reSp0nsive,,ess of °*ers. Deleterious . 

by withH m C,a i ,0nS are most effectively modified 
) thdrawing the reinforcement supporting coer- 
-v behavior and by developing morfconsole 
means of securing desired reactions from others 
Because social control involves a two-way process 

md r;S g,y n° UbtfuI tha ' &e Brave Nw Worl'k 

XeloT:r 1] e : er be attainabie ’—w 

io r S in he conditions governing human behav- 
have Wherever you have controllers you 

experi“r erC ° ntT f 0ll , eni 35 WeIL B0th parties anally 
expenence some feeling of powerlessness in achiev¬ 


Social Learning Theory 


ing the outcomes they desire. Parents voice dis- 
couragement because they cannot get their children 
to follow their wishes, whereas children feel con¬ 
strained by their parents from doing what they want. 
■ t universities the administrators, faculty, students 
and alumni all feel that the other constituencies arc 
unduly influential but that they themselves have in¬ 
sufficient power to alter the institutional practices 
In government functioning Congress feels that the 
executive branch has excessive power, and converse- 

f the executlve branc h feels thwarted in achieving 
its programs by congressional countercontrol. What 
ever the levels of interaction might be, no one party 
is able to manipulate the other at will. Indeed re¬ 
ciprocal influence systems are protected by legal and 
social codes designed to prevent easy use ol im 
penous control with impunity. 
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